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Preface 



The derivation of the equations of motion for extended bodies represents a 
fundamental problem in relativistic gravitational physics. Since the early days of 
Einstein’s theory of General Relativity, a wide spectrum of methods has been 
proposed to allow for the derivation in dilferent physical settings. Without having 
such equations of motion at hand it is impossible to correctly describe, for example, 
the motion of binary systems, or to calculate the form of gravitational waves 
emitted by such systems. These equations are also crucial for astrometry and for 
high precision satellite missions. 

In 2013 we organized^ an international conference in Bad Honnef (Germany) on 
the “Equations of Motion in Relativistic Gravity”. The conference brought together 
the leading experts in their respective fields, and was very well received by the 
speakers as well as by the audience. We would like to thank the WE-Heraeus 
Eoundation for the generous support of this conference. Our thanks also go to the 
Physikzentrum Bad Honnef where the conference took place. 

The very positive reception and the feedback after the conference made clear that 
there is a strong demand for an up-to-date textbook, covering the wide spectrum of 
methods employed in current research in the context of the relativistic problem of 
motion. This book intends to give a broad and up-do-date status report on this issue. 
It hopefully is of value for the experts working in this field and may also serve as a 
guideline for students with a background in General Relativity, who would like to 
enter the field. 

The present volume is based on the lectures given at the conference, and gives an 
overview of the following topics: 

• Derivation of the equations of motion in General Relativity (GR) and in alter- 
native gravity theories for 

- Spinning and extended test bodies 

- Self-force effects 



^http ://puetzfeld.org/eom20 1 3 .html. 
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- Self-gravitating/heavy bodies 

- Micro structured bodies with internal degrees of freedom 

• Approximation methods linked to the derivation of the equations of motion 

- Multipolar methods 

- Post-Newtonian and post-Minkowskian methods 

• Current and future observations 

- Binary systems, in particular binary pulsars 

- Gravitational waves 

- Celestial mechanics and astrometry 

- Satellite experiments 

The contributions in this book roughly follow the order of presentations at the 
conference, i.e. there are contributions on test bodies, self-force effects, the 
dynamics of self-gravitating bodies, as well as on current and future observations. 

We as editors are deeply indebted to the contributors to this volume, who made 
great efforts to present their respective areas of research in an accessible way to a 
broader audience. We hope that the material presented here will prove to be useful 
as a reference for experienced researchers, as well as serve as an inspiration for 
younger researchers who want to enter this exciting field of gravitational physics. 



Dirk Puetzfeld 
Claus Lammerzahl 
Bernard Schutz 
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The New Mechanics of Myron Mathisson 
and Its Subsequent Development 



W.G. Dixon 



Abstract In 1937, Myron Mathisson published a paper which initiated a program of 
work in general relativity that continues to the present day. The aim of this program 
was to obtain equations that determine the trajectory of an extended body in an 
external gravitational field. In Newtonian mechanics this is a straightforward task, 
but in general relativity even giving a precise meaning to the problem is fraught with 
difficulties. These difficulties are analysed, Mathisson’ s approach is described and it 
is shown how his approach has been carried to fulfillment by subsequent authors in 
the years since Mathis son’s untimely death in 1940. This work, however, completes 
only a part of the overall program. The work is placed in context in this overall 
program and the issues remaining for that program are identified. 



1 Introduction 

In 1937, Myron Mathisson published a paper [1] in German in the Polish journal 
Acta Physica Polonica with the ambitious title, in translation, of A New Mechanics of 
Material Systems. Long neglected, this paper was finally made available in English 
translation in 2010 [2]. This was the first paper to consider how to describe an 
extended body in general relativity in a manner similar to that of Newtonian rigid 
body mechanics. 

It was already known that a point particle with no internal structure follows a 
geodesic path. This had originally been a postulate of general relativity additional 
to the vacuum field equations, but Einstein and Grommer [3] had shown it to be 
a consequence of those equations and so not needed as a separate postulate. Their 
derivation treated the particle as a singularity in spacetime. The path followed by the 
particle was a geodesic in the background metric from which the field of the singu- 
larity itself had been subtracted. Einstein published a second paper [4] later that year, 
extending the work to include the presence of an electromagnetic field. The approach 
was dictated by Einstein’s philosophical beliefs: the particles being considered are 
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the elementary particles of which all matter is composed, the vacuum field equations 
will break down in the immediate vicinity of such a particle, but as nothing is known 
about the manner of this breakdown, the best that can be done is provisionally to 
treat such particles as singularities in the gravitational and electromagnetic fields. 

This philosophical view of Einstein causes problems when the question of interest 
is the motion of astronomical bodies, which is the only arena in which general 
relativity had been tested at that time, or indeed is likely to be tested in the foreseeable 
future. To treat such a body truly within Einstein’s philosophy requires treating it 
as an assembly of elementary particles, each of which is represented as a spacetime 
singularity. In effect this requires the development of a general relativistic version of 
statistical mechanics, a formidable task. 

The alternative is to say that, on an astronomical scale, the individual bodies to be 
considered are sufficiently small that they can themselves be treated as particles. This 
may seem plausible but it requires proof. A derivation of the geodesic postulate from 
this viewpoint was given first by Robertson [5], who expressed his philosophy thus: 

In the general theory of relativity, as in many other branches of theoretical physics, the 
material and energetical content of space-time is considered, in the first instance, as an 
extended field, which is specified by means of field quantities (energy-momentum-stress 
tensor, charge-current density, electromagnetic field strength). From this point of view par- 
ticles are constructs obtained by first considering the field quantities as non- vanishing only 
within certain world-tubes, and then passing by limiting processes to the idealisation in 
which these world tubes are shrunk into world-lines. 

This was the philosophy adopted by Mathisson, who was writing at about the 
same time as Robertson but with much greater ambition. Mathisson wished not only 
to take the point particle limit but also to be able to treat the body as extended and to 
investigate the effects of its extended size on its motion. This approach raises its own 
questions, in particular what point to take as describing the position of an extended 
body and how to describe its structure. Mathisson addressed the second question first, 
by choosing initially an arbitrary world line within the body as describing its position 
and motion and then replacing the energy-momentum tensor with an equivalent 
infinite set of tensor fields defined along this world line which were considered to 
be multipole moments of the energy-momentum tensor. The covariant conservation 
equation satisfied by the energy-momentum tensor must then translate into a set 
of equations governing the form and evolution of the multipole moments. The first 
question, the choice of representative world line, is then answered by selecting one 
that simplifies these results in some way. 

This is the ambitious plan of Mathisson’s “New Mechanics”. Despite its title, his 
paper of 1937 was only a first step towards this goal. He continued to work on this 
problem until his untimely death in 1940. This work was later taken up by others 
and the way his plan was eventually taken to completion is the subject of this paper. 
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2 Myron Mathisson, the Man 



First a little of his history. See Sauer and Trautman [6], on which this account is 
based, for more details. 

Myron Mathisson was a Polish Jew, born in Warsaw on 14 December 1897. He 
studied Civil Engineering at the Politechnika Warszawska from 1915. In 1919 he 
was drafted into the Polish Army, which was then fighting the Soviets. After that war 
he entered Warsaw University in 1920 to study physics. In 1925 he wrote a paper 
entitled On the motion of a rotating body in a gravitational field. This was never 
published, but the professor of theoretical physics at the University offered to accept 
it as a Ph.D. thesis. Mathisson was not satisfied with his own work and declined 
the offer. 

In 1927, Einstein published the two papers described in the Introduction. As was 
seen there, Mathisson had a very different philosophy about the problem of motion 
in general relativity than did Einstein. This led him, on 18 December 1929 with no 
Ph.D. and no publications, to write the first of what eventually turned out to be around 
20 letters to Einstein. The start of this letter is shown in Eig. 1 . He wrote that Einstein 
neglects radiation and deviations from spherical symmetry, these approximations 
being due to the “mathematical insufficiency of your method”. The letter continues 
for 1 1 pages. 

At the time that he received this letter, Einstein’s collaboration with Grommer had 
come to an end and he was looking for someone else. He responded to Mathisson’ s 
audacious letter not by throwing it away but by inviting Mathisson to Berlin as a 
collaborator. Mathisson declined the invitation on the grounds that he did not yet 
feel ready for such a step. 



£17 n ^ z e ar ^ 



Fig. 1 The beginning of Mathisson’s first letter to Einstein, 18 December 1929 (Albert Einstein 
Archives at the Hebrew University of Jerusalem, 18-001) 
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In 1930, Einstein attempted unsuccessfully to obtain a stipend from the Rockefeller 
Foundation for Mathisson. Between 1932 and 1936, Mathisson lectured at Warsaw 
University but without a permanent position. In 1936 he left Warsaw for a position 
in Kazan in Soviet Russia. While there, he received a letter from Einstein inviting 
him to come for a year to the Princeton Institute for Advanced Study, where Einstein 
himself was at that time. Mathisson replied that he could come for the academic year 
1937-1938 after a year in Russia. He added that he had found good conditions to 
work in Russia and that he would return to Kazan after his time in Princeton. 

Einstein replied, expressing delight that Mathisson had found good conditions to 
work in Soviet Russia but also withdrawing his invitation to Princeton as it would 
not be right while there were so many scientists deprived of the possibility of work. 
Einstein then began a collaboration with Leopold Infeld and Banesh Hoffman and 
wrote to Mathisson in Kazan on 7 May 1937 that they have developed a new method 
(later known as the EIH method) of deriving the equations of motion for point masses. 
Infeld received the invitation to Princeton in place of Mathisson. The EIH paper [7] 
was submitted to Annals of Mathematics on 16 June 1937. Infeld arrived in Princeton 
in October 1937, the date when Mathisson had offered to arrive. 

Mathisson returned to Warsaw from Kazan at the end of May 1937 as the situation 
for a foreigner there had become unbearable. On 5 September 1937 he submitted 
his most important paper [1] to Acta Physica Polonica. One can only wonder how 
different the development of equations of motion in general relativity might have 
been had Mathisson’s visit to Princeton taken place and led to an Einstein-Mathisson 
collaboration instead of the Einstein-Infeld one. 

In 1938 Mathisson went to Cracow where he collaborated with Adam Bielecki 
and Jan Weyssenhoff on unresolved issues of his 1937 paper, the results of which 
were published in [8] . While in Cracow he married Irena Jungermann. In the spring of 
1939 the Mathissons went to Paris and later that year to Cambridge where he worked 
with Dirac. There he published a further paper [9] on his approach to equations of 
motion, this time considering a body under the influence of external forces within the 
framework of special relativity. Suffering from tuberculosis, he died in Cambridge 
on 13 September 1940. His work had so impressed Dirac that he edited Mathisson’s 
unpublished notes into a posthumous paper [10]. 

In his historical account of work on the problem of motion, Havas [11] is very 
critical of the work on the problem of motion done by Einstein and, especially, 
Infeld and writes (p. 267) that “Mathisson’s contributions were far more original 
than Infeld ’s and introduced far better mathematical methods”. 



3 Mathisson’s Paper of 1937 in Outline 

The central concept introduced in Mathisson’s paper of 1937 is what he termed the 
gravitational skeleton of a body. This describes a body occupying a world tube of 
finite spatial extent and is defined relative to a timelike world line L within this world 
tube. It provides an alternative description of an extended body to that provided by 
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the energy-momentum tensor .li consists of an infinite set of multipole moments 

symmetric separately on their last two indices a, [3 and on 
all the remaining indices, such that 

( 1 ) 

for symmetric tensor fields ^a(3 of compact support. This is effectively treating the 
energy-momentum tensor as a distribution (generalised function) before the concept 
had been invented. The integral on the left extends over all spacetime, with Dx = 
being the spacetime volume element. That on the right is over the world line 
L on which 5' is proper time, Vq, denotes covariant differentiation and Va(5 = 

Note that this definition in fact differs slightly from that of Mathisson, in that he does 
not include the l/n\ factor in the 2^^-pole moment term. These factors are included 
here for agreement with later work of other authors. 

In what follows, the signature of the spacetime metric is taken as —2, so that 
the unit tangent vector to L satisfies VaV^ = 1. Round brackets around indices 
denote symmetrisation, square brackets denote antisymmetrisation. The sign of the 
curvature tensor is such that for a covariant vector field the Ricci identity takes 
the form 



= 2^'^a/37^5- (2) 

The energy-momentum tensor satisfies the covariant conservation equation 

= 0. (3) 

There is an equivalent property of the gravitational skeleton, obtained by taking 
in (1) to have the form 



^ajS — (4) 

The left hand side of (1) then vanishes as a consequence of (3), so the moments must 
satisfy 



^ + ■ ■ ■ ^ d5 = 0 (5) 

for of this form. Mathisson called this the variational equation of dynamics, the 
variation in question being the ability to vary at will. All the dynamics of the 
body is contained implicitly in this equation and his “new mechanics” consists of 
extracting its consequences as explicit restrictions on the form and evolution of the 
moments. 
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Mathisson continued by considering successively a body small enough that only 
the first one, two or three moments need be taken into account. For the case of a 
point mass, when only the monopole moment is retained, it follows from (5) that 
hsLYQ the form 



( 6 ) 

for some scalar function M(s), where is the unit tangent vector to L, and further- 
more that 



— (Mu“) = = 0. (7) 

as 

This in turn implies that M is independent of 5' and hv^/ds = 0, so L must be a 
geodesic. 

When the dipole moment is also retained, he noted that (1) does not determine 
the moment tensors and uniquely. The transformation 



m ^ ^ m ^ H , 

d^ 


(8) 




(9) 



leaves the right hand side of (1) invariant for arbitrary symmetric The additional 

terms combine to give a total derivative that integrates to zero, since has compact 
support. He used this freedom to impose the additional condition 

= 0 ( 10 ) 

and then deduced from (5) that must have the form 

( 11 ) 

where is antisymmetric and both it and are orthogonal to on all indices. 
He later identifies as the angular momentum (spin) tensor of the body. 

Next he chooses coordinates such that = s, = 0 on L for / = 1, 2, 3 
and such that the hypersurfaces of constant y^ meet L orthogonally. If (1) is written 
explicitly in these coordinates and ipap is expanded about y^ = 0 on each hypersur- 
face of constant y^, the coefficients of d(poo/dy^ on both sides may be equated to 
give 

n‘ = j (12) 

with the interpretation that is the static dipole moment of the mass density p = 
Its value will therefore depend on the choice of the world line L, enabling L to 
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be chosen as the world line of a centre of mass such that = 0. Since = 0 it 
follows also that = 0 so that for all components 

= 0, (13) 



with which (11) simplifies to 



(14) 



With L determined by (13), Mathisson shows successively that (5) requires 5'“^ 
to satisfy 



g^a/3 55«7 

h V = 0, 

d^ ^ d^ ^ ds 



(15) 



that must have the form 






8 5 / 1)7 



(16) 



for some scalar function M of s and that 












(17) 



The equations of motion in this approximation are (15) and (17) which govern the 
evolution of variables M, and . Since is a unit vector and is antisym- 
metric, these have respectively one, three and six linearly independent components. 
There appear at first to be ten equations for these ten variables but the left hand side 
of (15) vanishes identically when multiplied by reducing the number of equations 
from ten to seven. 

For their solution to be determinate they must be supplemented by 

= 0 (18) 

which forms part of the definition of 5'*^^ through (11). This reduces the number of 
variables also to seven. When (18) holds, multiplying (15) by Va shows that M is 
constant as in the monopole case, reducing the set to six equations for six variables. 

A condition added to (15) and (17) to make them determinate has become known 
as a supplementary condition, although the term was not used by Mathisson. His 
condition results from his imposition of (13) in the derivation of the equations. Other 
authors have applied different conditions, as will be seen later. 

Finally Mathisson considers the case where the quadrupole moment is also 
retained. He uses the non-uniqueness of the moments to impose, as an analogue 
of (10), the orthogonality condition 
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= 0 (19) 

and then decomposes the moment tensor into parts parallel to and orthogonal to as 

= h'jSaf} ^ ^ ( 2 O) 

The tensors b ' are orthogonal to on all indices, are symmetric on their first two 
indices and is also on its last two indices. He shows that (5) implies that 

jyilSa) ^ ^(75a)/3 ^ q ^2\) 

but deduces incorrectly that the second of these implies, when taken together with the 
symmetry of on 7, 5 and a, f3, that this tensor is identically zero. In fact these 
symmetry and orthogonality properties leave b^^^^ with six linearly independent 
components, b'^^^ with eight and b'^^ with six, a total of 20 for the quadrupole 
moment overall. The significance of this count will become clear in Sect. 11. 

He continues by neglecting the b'^^^ contribution as he wants to retain only those 
quadrupole elements with classical counterparts. It follows that he would also have 
neglected the b^^^^ contribution had he not incorrectly believed it to be zero. In 
the coordinate system and notation of (12) the remaining contribution b^^ is zero if 
either index is 0 and the purely spacelike components are given by 

b'} = J ( 22 ) 

Due to the quadrupole presence, expression (1 1) for the dipole tensor gains 
additional terms that involve b^^, as does Eq. (15) governing the spin tensor 
which then becomes 

Note that the result in Mathisson’s paper has an additional factor 2 on the right-hand 
side. Both forms are correct, the difference being due to the presence in the quadrupole 
term of (1) of the 1/2! factor not present in Mathisson’s paper, as discussed above in 
connection with (1). 

The right hand side of (23) represents the torque exerted on the body by the 
gravitational field due to its quadrupole moment. He does not derive the modified 
form of Eq. (17) which would also be expected to gain additional terms. 

4 Issues Arising from the Paper of 1937 



There are several major issues left unresolved in this paper of 1937. It is useful to 
identify and study these as they provide a context in which to evaluate later work on 
the subject. 
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The first issue concerns the existence and uniqueness of the moment tensors. In 
that paper their existence was dealt with only briefly. Indeed, he arrived at (5) by 
a different route than that presented here, without ever writing (1) explicitly. The 
method presented here appears only in a footnote to his Eq. (2.31), where he says 
that it would be the shortest way to the result (5). He recognised their non-uniqueness 
but used it only to impose the orthogonality conditions (10) and (19) in the cases he 
studied. 

This issue was the subject of his paper [8] of 1939 jointly with Bielecki and 
Weyssenhoff. There the definition (1) was given explicitly. The aim of the paper was 
to show that the moment tensors exist and that they become uniquely defined if in 
addition they are required to be orthogonal to on all indices except the last two. 
This requirement includes (10) and (19) and is their natural generalisation to higher 
order moments. 

The existence proof merits consideration because it too leaves an important issue 
unresolved, one that is crucial in the subsequent completion of his program. It begins 
by introducing a special coordinate system in which = s, = 0 on L for 
i = 1,2,3 such that the hypersurfaces E (^') given by =5' intersect L orthogonally. 

These are the same conditions used in the derivation of (12). It then defines quantities 
which are not tensors, whose components are zero if any 
index except the last two is 0 and whose other components are given by 

= [ y (24) 

J'E(s) 

where there are any number of indices i ... j and corresponding factors y\ . .y-^ , 
including none. 

Expansion of ipa(3 on T>(s) in a Taylor series about = 0 then gives 

J T ^ (fajS ~ /.( ^ H ^ ds. 

(25) 

An ^-fold partial derivative can be expressed in terms of covariant derivatives of all 
orders from n downward, together with terms involving components of the affine 
connection T^^. By a regrouping of terms, (25) can therefore be written in the form 
(1) where the m’s of any order are expressed in terms of the M’s of that and all higher 
orders and of the components of the affine connection and their partial derivatives. 
Transformations such as (8) with (9) can then be used to impose orthogonality to 

on all indices of the m’s except the last two. Repeated use of the Ricci identity 
enables all except the totally symmetric part of repeated covariant derivatives to be 
eliminated, so that without loss of generality the m’s can be taken as symmetric on 
all indices except the last two. By construction they are already symmetric separately 
on the last two. These steps all involve further regrouping of terms but they leave the 
overall form of the result (1) unchanged. 
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This completes the existence proof. The paper gives explicit expressions for the 
m’s up to the quadrupole moment in terms of the M’s up to this same order, 
and its partial derivatives. The uniqueness proof is straightforward and will not be 
described here. 

The issues arising from the existence proof concern the various infinite series 
involved. The paper makes explicit at the outset that it assumes the convergence of 
the series that arise and the analyticity of functions so far as is required. The field 
in (1) may be chosen freely, so there is no problem with the non-covariant Eq. (25) 
as it can be chosen to be analytic on the hypersurfaces T>(s). There is, however, a 
problem with its manipulation into the form (1). 

To see this, note that the m’s at z(s) e L are determined by the M’s with this 
value of 5' and so in turn by the value of in the neighbourhood of Ti(s) together 
with affine connection terms. In the neighbourhood of rather than on as derivatives 
with respect to 5’ will arise from the transformations such as (8) with (9). But T>(s) 
is not uniquely defined. It is any spacelike hypersurface through z(s) that meets L 
orthogonally. If the series for the m’s in terms of the M’s converged, the uniqueness 
of the m’s would imply that these series would sum to the same value for all such 
choices of E(5'). 

If were also required to be analytic then this would be possible, as its value in 
the neighbourhood of one such hypersurface would determine its value throughout 
the domain in which it was analytic, and so in particular also on all other choices 
of S(5'). But is a physical field. It may not even be smooth, it is certainly not 
analytic. Its value in the neighbourhood of one such hypersurface does not determine 
its value on any other, except where they meet. So in general the series for the m’s 
in terms of the M’s cannot converge and the existence proof falls. 

This may seem like unnecessary pedantry, criticism that would be appropriate for 
a piece of pure mathematics but out of place and of no significance in mathematics 
applied to the physical world. It is, however, of crucial importance. Mathisson’s 
program is based on (1) being an implicit definition of the multipole moments. This 
definition leads to (5), a single equation that embodies all the consequences for the 
moments of the conservation equation (3) for the energy-momentum tensor. It needs 
to be possible both to extract explicit evolution equations for the moments from 
(5) and explicit expressions for the moments in terms of from (1), for without 
the latter the meaning of the moments governed by the former is unknown. The 
implicit analyticity of the energy-momentum tensor must be circumvented before 
Mathisson’s program can be completed. 

The second issue arising from the 1937 paper is the one that arose in his treatment 
of the quadrupole. In the monopole and dipole cases no further approximations 
were needed beyond the neglect of higher order moments. In the quadrupole case 
this was not so. Only certain components of the quadrupole moment were retained, 
those with a classical analogue. This was not because the other components lacked 
interest, or at least not only for this reason. Had all components been retained, 
it would have added an unmanageable level of complexity. Progression from the 
monopole to the dipole case added a second evolution equation, with both equations 
then involving time derivatives of the dipole moment, i.e., of the spin. In the same 
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way, progression from the dipole to the full quadrupole case would have added a third 
evolution equation and all three would have involved time derivatives of quadrupole 
components. Each successive approximation would add a further equation in this 
way, and a complete solution of the variational equation would have an infinite set 
of equations, each involving time derivatives of an infinite set of moments. 

In Newtonian mechanics there would only be two evolution equations, those 
for momentum and angular momentum. Mathisson’ s new mechanics should aim at 
maintaining this simplicity. This would require eliminations between the full set of 
equations to leave only two, those governing momentum and angular momentum. 
The issue is whether or not this can be achieved, and if so how it can be done. 

The final issue concerns the supplementary condition (18). Equation (17) is of 
second order in . By use of (18) it can be put in a form that is of first order in 
but then it becomes second order in The set of equations (15), (17) and (18) 
then has solutions corresponding to arbitrary initial values of position, velocity, spin 
and acceleration. This is greater freedom than classical mechanics allows. 

If the equations of motion are considered to represent a point particle then it 
is a matter of fact whether (18) is or is not satisfied. In a second paper [12] in 
1937, Mathisson showed that this additional freedom in the solutions provides a 
classical analogue of the zitterhewegung of the quantum theory of the electron. But 
if the equations of motion are to represent an extended body then the supplementary 
condition selects the world line used to represent its position. It is no longer a matter of 
fact, but of choice, whether (18) holds. The freedom to specify an initial acceleration 
is now a defect. It does not correspond to a physical freedom but merely to the 
failure of (18) to determine a unique world line. It points to the need for a different 
supplementary condition that does not admit such solutions. 



5 The Unphysical Solutions 

The unphysical solutions allowed by the supplementary condition (18) are present 
also in special relativity. In this context they were studied by a number of authors in 
the period 1939-1950. It was noted above that they must be avoided when modelling 
an extended body. To achieve this, their origin needs to be properly understood. The 
following account is based on the work of Pryce [13] and Mpller [14] and uses a 
Minkowskian coordinate system in the fiat spacetime of special relativity. 

Consider an extended body with symmetric energy-momentum tensor that 
satisfies the conservation equation (3). Given a spacelike hypersurface E that spans 
the body and a point z not necessarily on E, let 




( 26 ) 
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and 

S»P = 2 J (xt“ - (27) 



The difference between their values for two different choices of E can be converted 
by Gauss’s theorem to volume integrals. The integrands then vanish by the conser- 
vation equation for and because it is symmetric. These quantities are therefore 
independent of the choice of E . As the definition of does not involve z, it follows 
that is constant while becomes a function only of the point z. These proper- 
ties have led to them becoming the generally accepted definitions of momentum and 
angular momentum (spin) for an isolated body in special relativity. Generalisations 
to more complex situations such as general relativity or the presence of external 
forces need to reduce to these in this special case. 

Let z move along an arbitrary timelike world line L parameterised as z(s) by 
proper time 5’ and let = dz^/ds, which satisfies v^Va = 1. Differentiate (27) 
with respect to 5', holding E fixed as the values of the integrals do not depend on it. 
This gives 

(28) 



from which 

= Mv°^ + vs 

d^- 

where M = p^Va- Since p^ is constant, (29) gives 



d 

d^ 



+ 




(29) 



(30) 



while (28) and (29) together give 



d^- 



+ 



dS^^ 

ds 






d5«7 

ds 



= 0 . 



(31) 



These are precisely the forms taken in fiat spacetime by Mathisson’s equations 
(15) and (17) in his dipole approximation, but now they are exact and the supple- 
mentary condition (18) has not yet been imposed. The variables also agree, to the 
accuracy of his approximation. To see this, note first that in the nota- 

tion of (24) when quadrupole and higher contributions are neglected, and that the 
definition (27) of corresponds to to this accuracy. But (14) 

gives = 5^^, so it follows that the two tensors are in agreement. The 

vector is the unit tangent to L in both cases and in order for the right hand side 
of (29) to be constant with both definitions of M, these must also agree. 

This agreement enables the origin of the unphysical solutions arising from the 
supplementary condition (18) to be studied in a context where the equations of 
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motion are exact. First suppose that z is any point on some L that satisfies (18). 
Choose E in (27) to be the hyperplane — z^) = 0 through z. For this choice, 

dTia is parallel to Va so that cIEq, = Hence (18) with (27) gives 



z"(^) 



/ 



x‘^T>^'>Visv^v^d'Ls 



T^^VfjV^v^d'Ls, 



(32) 



showing that is a centre of mass for the mass (or energy) density in 

the instantaneous rest frame. It follows that for a body with positive energy density 
throughout, i.e., T^^VaVf] > 0 for all choices of timelike L lies within the body 
(suitably interpreted if spacelike cross-sections of the body are not convex). 

This is a restriction on the solutions for L but it does not make L unique. To study 
the non-uniqueness in more detail, let 



K»i 3 ^2 j (33) 

SO that (27) gives 

5"^ (5) = - 2z^°‘(s)p^\ (34) 



This shows that there is a unique world line parameterised by a scalar A, given by 



= Ap“ + 



P-Ipy 



(35) 



on which 

= 0. (36) 

If the Minkowskian coordinate system is chosen so that this line is the time axis 
then it is also given by so (34) gives S^^{s) = and (36) becomes 

= 0. (37) 

Let Latin indices take values 1,2,3 and let m be the magnitude of , so p a- 

Then with these coordinates p^ = 0, = m and hence from (37) also = 0. 

From (34) the supplementary condition (18) can be put in the form 



Vfi - 2z^°‘{s)pf^'^ = 0. (38) 

This has only three linearly independent components, which can be taken as those 
with ct = 1, 2, 3, since multiplication by gives an identity. It is therefore equiva- 
lent to 



VjK^^ — mz^VQ = 0 



(39) 
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in the chosen Minkowskian coordinates. Since is antisymmetric it can be rep- 
resented by a three-dimensional vector Ki such that Kk where 

is the three-dimensional Levi-Civita symbol. If t = is coordinate time then 
/v^ = dz^ /dt. These results enable (39) to be expressed in three-dimensional 
vector notation as 



dz 

mz X K = 0. 

dt 



(40) 



Since K is constant, the time derivative of (40) shows that dz/dt and K are 
orthogonal. The vector product of (40) with K therefore gives 



dz 

dt 



m 

— ^ K X z. 



(41) 



This corresponds to motion about the origin in a circle with angular velocity m/^, 
where K = |K|, in a plane perpendicular to K. From (40) the radius of the circle 
is vKjm where v — \dz/dt\. Units here have the velocity of light c = 1 so since 
L is timelike, i; < 1. The possible circular motions therefore all lie in a disc of 
radius KImm this plane and centred on the origin, but all such motions give an L 
that satisfies the supplementary condition, so confirming the non-uniqueness claimed 
above. For a body with positive energy density throughout, it was seen above that all 
choices of L that satisfy the supplementary condition lie within the body, suitably 
interpreted. It follows that this disc lies within the body, which gives a minimum size 
for such a body when its spin is nonzero. 

A different perspective arises if the Eqs. (30) and (31) are taken in isolation, so 
that the quantities M and are considered to be properties of the body rather 
than constructs from the energy-momentum tensor. This is the situation when the 
equations of motion are used to model a point particle whose motion satisfies (18) as 
a matter of physical fact. The same results hold but they need to be cast in terms of 
M and rather than m and K. With the same choice of Minkowskian coordinate 
system used above, (34) gives but whereas = 0, is not zero. 

From (18), 

= -S'^vj/v^ = -K‘jvj/vo = - (42) 

dt 



But dz/dt and K are orthogonal three-dimensional vectors and 5/o = due to 
the form of the metric tensor, so 



S‘^Sio = -{vKf. (43) 

With the magnitude S of the spin tensor 5'“^ defined by = S‘^^Sa/3/2, (43) gives 

= K^{1 - v^). 



(44) 
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Also 



M = p^Va 



■ mvQ 



■ m 



/\/ 1 — 



(45) 



so 



vK _ vS 
m M(l — v^) 



(46) 



which gives the radius of the circular orbit in terms of S and M as required. But 
whereas i; < 1, which gave a limiting size to the radius, v/{l — v^) is unbounded as 
i; ^ 1. With this perspective there is no limit on the size of the circular orbits. As 
described above, the initial acceleration of the particle can be arbitrary and it is this 
choice that determines which orbit is followed. 

This form of solution to the Eqs. (30) and (31) in flat spacetime was first given 
by Weyssenhoff and Raabe [15]. Mathisson [12] had earlier given a solution in the 
non-relativistic approximation that he had used to provide a classical analogue of 
the zitterhewegung of the electron. But as discussed in the preceding section, when 
the equations are used to model an extended body then these unphysical solutions 
must be avoided. The above analysis provides the clue to their avoidance, for it was 
seen that (36) does determine a unique world line. So in special relativity, if (36) is 
adopted as the supplementary condition in place of (18) then unphysical solutions 
are avoided. The issue in a curved spacetime is to find appropriate meanings for 
and This will be addressed in Sect. 21. 



6 The Period 1950-1970 



At his death in 1940, Mathisson [9] had been studying his variational equation in 
special relativity for a body under the influence of external forces, maybe looking 
for clues as to how to progress further with it in general relativity. This work led to 
a paper by Shanmugadhasan [16] in 1946 that continued this development in special 
relativity. The multipole approach to equations of motion in general relativity went 
dormant, however, until the appearance of a paper by Papapetrou [17] in 1951 entitled 
“Spinning test-particles in general relativity. I”, a title that does not show anything 
like the same vision as did Mathisson’ s. 

Papapetrou defined a set of non-covariant moment integrals identical to those 
of Bielecki et al. [8] given as (24) above. He continued by developing equations 
of motion in terms of these quantities and other similarly defined ones, also non- 
covariant, constructing combinations of them along the way that he subsequently 
proved did transform as tensors. This was done first for a particle in which only the 
monopole moment is retained, and then when the dipole moment is also retained, 
which he calls a pole-dipole particle. In this way he finally arrives at the Eqs. (15) 
and (17) first obtained by Mathisson. Mathisson’s paper [1] of 1937 is given as a 
reference, but with the statement that it gives a discussion of the pole-dipole particle 
in special relativity. Since Papapetrou’s paper of 1951 in Proceedings of the Royal 
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Society is far more accessible than Mathisson’s in a pre-war issue of Acta Physica 
Polonica, no-one seemed to check on this and for many years the equations of motion 
came to be associated with Papapetrou rather than Mathisson. 

Papapetrou’s work made one advance over Mathisson’s results, in that (15) and 
(17) are derived without imposing a supplementary condition. This enables the mer- 
its of any such condition to be considered as a separate issue. Papapetrou was aware 
of the circular motions in special relativity that are possible with the supplementary 
condition = 0, describing them as internal motions for a pole-dipole particle. 

He asserted that these give a most successful classical analogue of the zitterhewe- 
gung of the electron, giving a paper of 1940 by Honl and Papapetrou [18] but not 
Mathisson’s second paper [12] of 1937 on this topic as a reference. He noted that 
for a macroscopic particle, however, these internal motions must be avoided. In a 
joint paper [19] with Corinaldesi submitted together with [17] they studied motion 
in a Schwarzschild field using a supplementary condition = 0 where is 

the timelike Killing vector of the static spacetime. This has the merit of avoiding the 
internal motions but it is not applicable to non-static spacetimes. 

In 1959, Tulczyjew [20] gave another derivation of (15) and (17). He noted that 
these equations were first derived by Mathisson and, later, “by a far less perfect 
method” by Papapetrou. His method simplified Mathisson’s approach by the use 
of a singular energy-momentum tensor constructed from Dirac delta- functions. The 
main advance made by this paper, however, concerned the supplementary condition. 
Tulczyjew noted from the work of Weyssenhoff and Raabe [15] that in connection 
with the Eqs. (28)-(31) in special relativity, = 0 does not determine a unique 

world line in special relativity but that = 0 does so. He therefore adopted 

= 0 also in general relativity, setting 



o cap 

p^ = Mv^ -\-vg- (47) 

d^- 

in his words “owing to the lack of another definition”. This was the first use of a 
supplementary condition in general relativity that determined a unique world line 
without needing some variables external to the body concerned. This condition is 
not applicable to an extended body, however, as the variables it involves are defined 
only in the context of a singular energy-momentum tensor. 

A paper by Tulczyjew and Tulczyjew [21] in 1962 gave yet another derivation of 
(15) and (17), this time by an improved version of Papapetrou’s approach. The non- 
covariant moment integrals of Papapetrou were replaced by integrals made covariant 
by being defined in a special coordinate system based on the world line L chosen to 
represent the body. Consequently they do have well defined meaning for an extended 
body. It is noted that the equations need a supplementary condition of the form 
ppS^^ = 0 for a suitably defined momentum vector p^ but this vector is not one of 
the well defined moment integrals. It is defined only in the context of the pole-dipole 
approximation, so the condition is still not applicable to an extended body. 

The present author had a go at this [22] in 1964. This was again a version of 
Papapetrou’s approach, now with moment integrals made covariant by the use of 
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two-point tensors. The same equations were derived in the pole-dipole approximation 
and once again the main advance concerned the supplementary condition. This time 
the equations were derived in the form 



and 



d^ 






a 

■/S-fS 



(48) 



d^- 






(49) 



where and 5“^ were among the quantities defined as covariant integrals of the 
energy-momentum tensor. 

These equations have a well defined momentum vector but are nevertheless equiv- 
alent to (15) and (17) with M = p^Va- To see this, multiply (49) by and use 
v^vp = I to obtain an expression for p^. Use of this to eliminate from (48) and 
(49) gives (15) and (17) as required. 

At last the condition 

Pl3S^^ = 0 (50) 



becomes well defined for an extended body. It cannot yet be regarded as satisfactory, 
however, as the integral expressions are somewhat ad hoc. They could be modified in 
ways significant only in a quadrupole or higher approximation without changing the 
pole-dipole approximation but still affecting the world line that would be determined 
for an extended body. 

Attempts to extend (15) and (17) to include the quadrupole moment were made by 
Taub [23] in 1965 and Madore [24] in 1969. Neither seemed aware that Mathisson had 
already considered the quadrupole case in 1937. As with Mathisson, both attempts 
needed further approximations beyond those implied by truncation of the multipole 
series. They needed this for the same reason as did Mathisson. As discussed in Sect. 4, 
retaining all quadrupole components leads to a third evolution equation and the need 
for a potentially unmanageable task of elimination between all three. Both Taub and 
Madore obtained quadrupole corrections to both the momentum and spin equations 
while Mathisson had only obtained that for spin. Both attempts were flawed, however, 
by algebraic errors that invalidated the results as published. Corrected versions of 
their results were given by the present author in [25] where they were seen to agree 
both with one another and with the results of that paper. The corrected versions also 
agree with Mathisson for the one equation that he obtained. 

In the opinion of the present author, these advances over twenty years were only 
tiny steps. They had shed light on the problem of defining a unique centre of mass and 
made simplifications to the handling of the pole-dipole approximation but attempts 
to retain moments beyond the dipole had met a barrier that resisted all attacks. There 
seemed to be a locked door beyond which one could not pass. 

I believe that Mathisson had realised that he had reached a door preventing his 
further progress. At his death, Mathisson had been studying his variational equation 
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in special relativity. This appeared to be a step backwards. Perhaps it was this that 
had led Papapetrou to believe that Mathisson’s earlier work was also only in special 
relativity. It seems more likely to me that Mathis son had realised this would be a 
simpler arena in which to find the key to that door, but his untimely death prevented 
him from finding it. 

In 1967 [26] I took up the search for the key in this special relativity arena 
and found it, or rather the two keys that are needed to progress beyond this door. 
Armed with these keys, the path to higher order multipole approximations in general 
relativity became accessible but there were still obstacles to be overcome. This further 
development occupied me until 1974 and was published in [25, 27, 28] . Those papers 
deal with the presence of both gravitational and electromagnetic fields. The following 
sections deal only with the purely gravitational case, for which they reveal the keys 
and the route past the obstacles and take Mathisson’s program to completion. 



7 Fourier Transforms Avoid Need for Analyticity 

It was seen in Sect. 4 that Mathisson’s moment-defining equation (1) does not lead 
to unique expressions for the moments as integrals of and that the reason for 
this is its assumption that is analytic. The first key provides the means to avoid 
this assumption. It is the use of Fourier transformation. 

Mathisson’s defining equation (1) for the moments can be rewritten in full as 



Here L is again parameterised as but now for generality 5* is not necessarily 

proper time. Here and throughout, n is the number of indices in the set marked with 
dots, in this case J • • • 7, including the possibilities ^ = 0 and n = 1. 

Consider first the situation in fiat spacetime, with a metric tensor whose compo- 
nents are constant but not necessarily diagonal. Define the Fourier transform of 
by 



where k • x = k^x^. This is well defined since and k^ can be considered as 
position vectors when the metric tensor has constant components. If in (51) is 
written in terms of pap by the Fourier inversion formula, it gives 




n>0 




(52) 







n>0 



(s)(pai 3 ik) exp(-i k ■ z{s)). 



(53) 
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This is completely equivalent to (5 1) and holds no advantage over it. The key step 
is to exchange the order of the summation and the k-space integration, to give 

= (54) 

where 

J^dsJ maisizis), k) (55) 

with ^ 

k) = ^ ks • • • (s) (56) 

nl 



and 

^ap(z, k) = ^a^(k) exp(-ik • z). (57) 

In general this exchange is not valid, but the Eq. (54) that results requires no assump- 
tion about the analyticity of This is an improvement over (51) and so (54) will 
be taken, in flat spacetime, as the definition for the moments in its place. 

What is now needed is a similar change of definition in a curved spacetime. The 
left hand side of (54) is well defined as an integral over the spacetime manifold M. 
The ultimate aim is to define the m’s at z(^') to be tensors at this point. For (56) to 
be a covariant equation, this requires ka to be a vector at z(^’). For fixed 5’ this would 
make k) be a tensor at z(s) that is a function of a vector k^ at this point. 

The right hand side of (54), as given by (55), then becomes well defined if 
^a^(z(s), k) is also such a field. The set of all vectors at a point z forms the tangent 
space (M) to M at z so these are functions that map the tangent space to tensors at 
z. The tangent space is a vector space that can be considered as a fiat manifold with 
a natural metric tensor given by the metric tensor of M at z. From this viewpoint 
tensor fields on T^iM). In (55), Dk is the volume element on 
this fiat manifold. Fourier transforms on the tangent space are defined by (52), with 
X e T^{M) replacing x e M, just as for any fiat manifold. So ^a^(z(s), k) can be 
taken as the Fourier transform of a tensor field ^a^(z(s), X) on the same tangent 
space. In the fiat spacetime considered above, the ^a^(z, k) defined by (57) is given 
in this way when 



^a( 3 {z. X) = Pa(3{^) with X = V - z. (58) 

All that remains to make (54) be well defined in a curved spacetime is therefore to 
generalise (58) appropriately. 

This next paragraph uses terminology from differential geometry to define the 
mapping needed to specify this generalisation. It puts the mapping in its proper 
context, but it is not necessary for understanding the remainder of this paper. Where 
details of the mapping are needed, the necessary properties will be developed before 
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use. Most of that development is in Sect. 14, but here for the record is the specification 
from differential geometry. When M is fiat there is a natural identification of (M) 
with M, which is why (58) is meaningful in special relativity. In a curved spacetime 
the two are related by the exponential map Exp^ : T^{M) M, for which see 
Bishop and Crittenden [29] or many other standard texts on differential geometry. If 
X G T^{M) and x = Exp^X then the derivative map ((Exp^)=,=)x of Exp^ at X maps 
Tx(Tz{M)) isomorphically onto 7i(M). This mapping between tangent spaces has 
a unique extension to a mapping of the corresponding tensor algebras. This will be 
denoted by replacing the * by A (for Algebra) in the notation. By letting X vary, this 
gives a map (Exp^)^ from tensor fields on T^{M) to tensor fields on M. Given such 
a tensor field for each z, application of the corresponding maps produces a family of 
tensor fields on M parameterised by z, i.e., a two-point function with scalar character 
at z and tensor character at the second point x. Let Exp^ denote this overall map and 
Exp^ denote its inverse. 

What is needed at present is that there is a well defined linear mapping Exp^ that 
maps functions such as Oq,/ 3 (z, X), which is a tensor at z G M dependent on a vector 
X at z, into two-point functions such as (pa( 3 iz, x) which have tensor character at 
X and scalar character at z. Alternatively X) may be considered as mapping 

the tangent space T^(M) to tensors at z. The vector X and point x are related as 
follows. If y(w) is a geodesic on M with affine parameter u such that y = z and 
dy^/du = X^ when w = 0, then x = y(l). The map from T^(M) to M that sends 
X to X is known as the exponential map Exp^, so x = Exp^X. 

The inverse Exp^ of Exp^ acts in the opposite direction, so it acts on two-point 
functions such as Pa( 3 iz, x). It may therefore be applied to an ordinary tensor field on 
M by treating it as a two-point function that is independent of z. The interpretation 
of (54) in curved spacetime is completed by taking 



<t>a/3 = Exp^i^a/3 



(59) 



which has (58) as a special case. 



8 The Route to Moments as Explicit Integrals 

As with Mathisson’s original definitions, (54) only determines a unique set of 
moments if additional symmetry and orthogonality conditions are imposed. These 
will be taken as 



^7-fc/3(^) = torn > 0, (60) 



(s) = 0 for n > 1 



(61) 
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where n^{s) is a timelike unit vector at Recall that n is the number of indices in 
the set marked with dots, so these apply to moments with at least two or three indices 
respectively. These conditions differ from those of Bielecki et al. [8] in that need 
not be tangent to the world line L. This slight generalisation is for later convenience. 

It will now be shown that with these conditions, (54) does determine explicit 
integral expressions for the moments, and in particular that the hypersurfaces of 
integration are uniquely determined. Choose a Minkowskian coordinate system on 
r^(^) (M) such that 7 ^ 0 only for a = A. Then (56) and (61) showjhat k) is 

independent of . The inversion formula for the Fourier transform / (k) of a function 
/(v) of a single variable shows that 

J f{k)dk = 2nf(0). (62) 

With this, the k^ integration in the 5* -integrand on the right of equation (55) can be 
performed to show that its value for fixed 5’ depends on (z(s), X) only through its 

value on the hyperplane = 0, and hence on ipaf 3 only through its value on E(^’), 
the image of this hyperplane under Exp^ . This image is the hypersurface formed by 
all geodesics through z{s) orthogonal to 

Let r(x) be the scalar function such that r(x) = s if x e Ti(s) and let be any 
vector field such that = 1. Then 

T°‘^^Pap'Dx= [ ds [ di:^ (63) 



where is the vector- valued surface element on E(^'). The 5' -integrand here also 
depends on (paf3 only through its value on E(^'). Since ip a [3 is no longer required to 
be analytic, (54) still holds if (p^p is multiplied by an arbitrary smooth function that 
is constant on each The above dependency results show that this multiplier can 
be brought outside the k-space integral in (55) and similarly outside the hypersurface 
integral in (63). Since this multiplier is arbitrary it follows that the 5' -integrands must 
be equal, so that 

/ f Dk k)^ai3(z(s), k). (64) 

It is now possible to express pa (3 in terms of ^^(5 and so to identify k ) 

as an integral over T>(s). By expanding the resulting integrand as a series in ka, 
explicit expressions can be obtained for the moments as integrals of 

over T>(s). The details will not be given here as there are further modifications needed 
to the moment definitions before an exact solution to the variational equation can be 
obtained. The method, however, will be used with the final definitions in Sects. 12, 
23 and 24. 
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9 The Variational Equation Revisited 



As was seen in Sect. 3, Mathisson’s variational equation is obtained by taking 
in the moment-defining equation to have the special form 



^a (3 = (65) 

for an arbitrary vector field uoa of compact support. It follows from this that uoa 
satisfies 



0 

( 66 ) 

along all affinely parameterised geodesics x^(u), where = dv^/dw and curly 
brackets around three indices are defined by 

^{ 0 / 37 } — ^a(3j ~ ^(3ja "b (67) 

A result of this form is to be expected since if pa (3 = 0. is a Killing vector 
and (66) becomes the equation of geodesic deviation. This confirms the well-known 
fact that a Killing vector field satisfies the equation of geodesic deviation along any 
geodesic. 

Suppose now that pa (3 is known everywhere, together with the values of uOq^ and 
at a point z. These values for determine initial conditions for the integration 
of (66) along any geodesic through z, so the entire field can be determined by 

integration along all such geodesics. But the value of at z is given by (65), 

so the additional information needed about uoa is actually only the values 

Aa = u>a(z) and = V[„W/ 3 ](z). (68) 

Let be the unique vector field such that 

= Aa and = 8^(5 at z (69) 

and which satisfies the equation of geodesic deviation, i.e. (66) with the right hand 
side set to zero, along all geodesics through z. Note that 8^(3 is antisymmetric by its 
definition in (68). Put 

~ ^a- (70) 

Then Aq, satisfies the full form of (66) along all such geodesics and has 

Ac, = 0 and = Pap atz. (71) 

Note that A^ and are vectors at v that depend on both v and z. As such, they 
are functions to which Exp^ is applicable, so let 
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Aa = Exp'^Aa, Sa = and Ga!3 = Exp'^Po,^. (72) 



The map Exp^ : (M) ^ M under which X ^ x can be considered as giving a 

pointwise identification of the manifolds with one another. With this identification the 
components of g T^{M) become coordinates of x, forming a normal coordinate 
system on M with pole z. In these coordinates Aq, and Gaf3 become the components 
of Ac^ and With this interpretation, T^(M) is not fiat but is instead a Riemannian 
manifold with as its metric tensor. The coordinate expression 



— 



A/3 - rX«A^ 

d^Oi V a/3 7 



(73) 



for components of the covariant derivative maps under this correspondence to 



Exp^VaA/3 



dX^ 



■ “ ^a/3^7 



(74) 



where is the Christoffel connection on T^(M) corresponding to the metric Gap- 
In contrast to (73), the individual terms in (74) are tensors since the coordinate 
system given by the components of the position vector X^ is subject only to linear 
transformations. The notation 






d 

~d^ 



(75) 



will be used to emphasise the covariant nature of this partial differentiation on (M) 

while at the same time distinguishing it from the covariant derivative V a on M. With 
this notation the usual formula for the components of the Christoffel connection gives 



-1 

G^p Gaps where Gap^ = \^^{aG^p]. (76) 



-1 

Here G^^ is the matrix inverse of Gap, its contravariant form considered as a metric 
tensor, which needs to be distinguished from G^^ in which the indices have been 
raised with the fiat metric gap on It follows from (59), (65), (70) and (74) 

that 

^aP ^ap C 77 ) 

where 

Sa/3 =Exp^V(Q,^/3). (78) 

Since (74) assumes nothing about the form of Ac^, it may be applied also to ^a to give 

^aP — ^^{a^P) ^a/3 ^7 (7^) 



where Sq, is given by (72). 
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The left hand side of (77) is well defined for all symmetric whether or not it 
has the form (65), since the construction of Aq, by (66) with (71) does not depend on 
that special form. It is therefore possible to replace the moment-defining equation 
(54) by the modified form 

J Dx = (80) 

where the functional is still defined by (55). This redefinition serves the same 
purpose as the rearrangement of terms in infinite series in Sect. 4, but without the 
associated problems of convergence. 

This change only affects gravitational contributions to the moments since in fiat 
spacetime = 0 but it has an important effect on the variational equation, which 
by (77) takes the form 



[V*(„A^) + = 0. 



(81) 



Since is just partial differentiation by (75), Fourier transformation gives 

( 82 ) 

where F[- • • ] is used instead of a tilde for the Fourier transformation of a typograph- 
ically complex expression. Hence from (55) 

[V*(„A^)] = JdsJ Dkt^Aa (83) 



where 



t^(s, k) = k) = ^ k^ • • ’ (84) 

n>0 ^ 



with 

= 0 and = (n + for n > 0. (85) 

Once again n is the number of indices in a set such as (5 • • • 7 marked with dots, with 
values 0 and 1 allowed. The contribution (83) to the variational equation therefore 
involves only certain linear combinations of the components of the m’s, those given 
by (85). 

The other contribution involves the vector field cua only through the 

values Aa and of (68) along the world line L, as these are all the data required 
to construct from which in turn is constructed. Moreover for fixed z G L the 
dependence of E(^^(z, X) on and is linear. It follows that 
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( 86 ) 



for some Fa and antisymmetric Lap that are defined on the world line L by this 
equation and which are independent of the field uja- 



10 Tensor Decomposition Through Symmetry 

The tensors f " are formed from linear combinations of components of the m’s. 
This leads naturally to the question of what information remains in the m’s when 
that contained in the ^’s is removed. It seems likely that this residual information 
will remain unconstrained by the variational equation and will therefore provide the 
desired description of the freely specifiable part of the higher moments. 

The monopole and dipole moments are initially described by and m^^^, 
tensors with two and three indices respectively, but their final description in the 
equations of motion is by a and that have one fewer index. This seems to 
have led to a preconception among those who studied the quadrupole case that the 
free information in the quadrupole moment should be described by a three-index 
tensor. 

The second key to the locked door was the realisation that this is not the case, 
that there is a mathematical tool to answer the question of what remains when the 
t's are removed from the m’s. This tool is the theory of group representations, or 
more specifically of representations of the general linear group GL(n) that describes 
the transformation of tensor components under a change of coordinate system. This 
in turn is provided by the theory of representations of the symmetric group, i.e., 
the group of all permutations of a given order. This is the group that describes the 
symmetry properties of tensors. It is a powerful tool but once this is realised, what 
is needed can also be arrived at without its use. as will now be shown. 



Let 



f-il3a ^ ^ ^ ^(5-7/3«) + 2b for n > 0 (87) 



where a and b are constants. These satisfy the same symmetry relations 






( 88 ) 



as the m’s, while the Ls satisfy 






(89) 



It is easily seen that for any ^ > 0, there exist a and b such that 




( 90 ) 
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holds identically for all t ' satisfying (89). With these values, (85) and (87) show 
that the j " also satisfy 



j(S-ji3)a ^ Q forn> 0 . 



(91) 



The cases ^ = 0, 1 give 

_ 0 ^ y7/3o ^ = 0 (92) 

so that the m’s with two and three indices are equivalent to the corresponding t's and 
need no counterpart. 

These results show that a tensor m"' determines a corresponding j"' and t ' 
satisfying ( 88 ), (91) and (89) but it does not yet show that these two parts of m " are 
independent. To see this, suppose now that a tensor j ' with four or more indices is 
given that satisfies ( 88 ) and (91), along with a t"' with the same number of indices 
satisfying (89). Construct m ' from them so that (87) holds. Then (85) will hold 
due to (90) and (91) while (87) holds by construction. These are the two equations 
that define j " and t"' in terms of m " so the j " and t"' originally chosen will be 
recovered. This proves their independence, as required. 

It follows that j ' meets all the conditions for the counterpart tensor to T", but its 
expression (87) is rather complicated once a and b have been evaluated as functions 
of n. An alternative and more useful counterpart is given by 

jC-s5^(3a _ ^ ^C "£[^[/^7]«] for n > 0. (93) 

The second equality follows from (87) and gives an expression for the /’s in terms of 
the m’s in which the number of indices does not enter explicitly. The nested square 
brackets denote antisymmetrisation independently over pairs of indices, the opening 
and closing brackets being paired in order from left to right. In (93) this means that 
the antisymmetrisation is applied to the index pairs and Pa. 

It follows from this definition that the /’s have the symmetry properties 

^ j(C-e)[5^Wa\ jC-eS^pa] ^ q for n > 0 , ( 94 ) 

jC-[eSj]l3a ^ Q forn>l, ( 95 ) 



which for ^ = 0 and ^ = 1 are the symmetries of the curvature tensor Rs^pa and its 
first derivative ^sRs^yPa^ in the latter case when the Bianchi identities are taken into 
account. 

Some straightforward but tedious algebra shows that the 7 ’s can be recovered 
from the /’s by 

jOi-h^e _ j{a-p\5\-i)e for ^ > 1 

n + 2 



( 96 ) 
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where the vertical bars ‘ | ’ enclose indices that are to be excluded from the symmetri- 
sation. If / " satisfies (94) and (95) then the j ' constructed from it by (96) satisfies 
(88), (91) and (93). This completes the proof that the y’s and /’s are equivalent as 
counterparts to the r’s in the decomposition of the m’s. 



11 The Orthogonality Conditions Revisited 



As with Mathisson’s original moment-defining equation (51), its modified form (80) 
needs orthogonality conditions imposed for it to determine the moments uniquely. 
The conditions (61) used in Sect. 8 have the same symmetries as the moments them- 
selves and so can be split into the two sets 

= 0 for^ > 1 (97) 

and 

„^m‘^-e[S[l3'y]a] ^ q forn > 1. (98) 

Note that n has a different meaning in each equation as it is the number of indices 
in the set marked by dots. Both conditions are n > 1 but the moments with three or 
four indices only occur in (97), so it is only for those with five or more indices that 
the orthogonality condition separates into two parts. 

The second set can immediately be expressed in terms of the /’s of (93) as 

n^jC-eilHa foj.„>l. (99) 

The first set, however, is not a restriction purely on the ^’s. To complete the decom- 
position of the moments into two independent sets, it will therefore be modified for 
n >2hy extending the index symmetrisation to include the 5. This then gives 

n^lSr-/3a^Q for„>2. (100) 

This change does not affect the number of constraints placed on the components 
of the m’s concerned as the new and old conditions have the same symmetries. It 
cannot be applied for ^ = 1, however, since is symmetric but is not, 

so in that case it would increase the number of constraints. For the time being 
will be left unconstrained, which leaves a little indeterminacy in the definition of the 
moments that will be resolved later. 

The quadrupole and higher moments have both a /-part that satisfies (94), (95) 
and (99) and a r-part that satisfies (89) and (100). At first sight it seems difficult 
to count the number of linearly independent components in these parts due to the 
complexity of these conditions. In fact it is easy, though the count is best done on the 
equivalent j forms rather than directly on the / forms. In m dimensions the number 
of linearly independent components of a totally symmetric ^ -index tensor is given 
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by the binomial coefficient For the spacetime manifold m = 4. With this 

in mind, consider the tensors for a 2 ^ -pole moment for n > 2, for which both parts 
have (n -h 2) indices. An (n -h 2)-index tensor satisfying ( 88 ) or (89) will have N\ (n) 
or N 2 (n) components respectively, where 

*<»)=(";") x0). (101) 

The further condition (91) on the 7 -part gives N 2 (n) constraints, leaving it at this 
point with 



N^in) = A^i(^) — N 2 (n) = (n 4~ 2)(n 4~ 2)(n — 1) (102) 

components. This is therefore the number of linearly independent components of 
a J tensor satisfying (94) and (95). The orthogonality condition (99) has the same 
symmetries as the /’s but one fewer index, so it gives N^in — l) constraints, leaving 
the (^-h 2 )-index / tensorwith A 3 (/i)— A 3(/2 — 1 ) = (^-h2)(3^ — 1 ) such components. 
The corresponding count for the (n -h 2) -index t tensor is N 2 (n) — N 2 (n — 1) = 
2(n -\-2)(n + 3). 

It will be seen in the next section that for ^ >2 the t tensors actually vanish, 
leaving the /’s as the complete description of the quadrupole and higher moments. 
The quadrupole moment therefore has 20 such components, in agreement with 
Mathisson’s full description of his quadrupole before he neglected components with 
a non-classical analogue. This again shows the advantage of Mathisson’s approach 
over that of later authors; in their treatment of the quadrupole case neither Taub [23] 
nor Madore [24] had full descriptions of the quadrupole with 20 linearly independent 
components. 



12 Solution of the Variational Equation 



From (83) the contribution to M^^[V^(<^A/ 3 )] from the first term of the series (84) 
vanishes by (85) and the next two terms give 

j 0) + 0)) ds. (103) 

The Fourier transform has here been evaluated to give terms involving derivatives of 
Aa(z, X) Sit X = 0. Now it was seen in Sect. 9 that the components of X^ can be 
considered as a normal coordinate system on M with pole z. The terms in (103) 
are partial derivatives of Ac^ at z in this coordinate system. They can be replaced by 
covariant derivatives as each extra term vanishes, some because F^^ = 0 at the pole 
of a normal coordinate system, the others because Aq. = 0 at this point by (71). If 
t^a expressed in terms of the m’s by (85) then this contribution takes 

the form 
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( 104 ) 



Now (71) gives 



V/^Aq, = ip^a at Z 



(105) 



and since Aq. satisfies (66) along all geodesics through z it implies that 



^(7/3) Aq; — V{7^q;/ 3} at Z 



(106) 



where curly brackets around three indices are defined by (67). When this is put 
back into (104), two of the three terms in this curly bracket expansion cancel by the 
symmetry of on a and p. For (pcip of the form (65), the result is 



Suppose now that uja is chosen so that it, VpuJa and all vanish on the 

world line L. Then expression (107) vanishes, as do the parameters and on 
L given by (68), and so also the contribution (86) to the variational equation (81). In 
addition the vector field vanishes for all z e T by its definition through (69), so 
that Ack = oj a ^ For such uja the variational equation therefore reduces to 



The t's in the summation are those on which the orthogonality condition (100) 
was imposed. By (85) they are also symmetric on all the indices contracted with k’s. 
The method of Sect. 8 may therefore be applied to show that the k-space integral in 
(108) depends on uj^ only through its value on where as in that section, ^(s) 
is the hypersurface formed by all geodesics through z(z) orthogonal to ria- If uJa is 
multiplied by any function that is constant on each E (^'), it still satisfies the vanishing 
conditions on L so (108) remains valid. Each k-space integral becomes multiplied by 
the value of the function on the corresponding E (^') as it only depends on the value 
of uJa on that hypersurface. Since (108) has to hold for all such multiplier functions, 
it follows that the ^-space integral must itself be zero for all 5'. 

Let r (v) be the scalar function such that r(x) = s if x e E (^'), again as in Sect. 8. 
Let paM bo any smooth vector- valued function of compact support. Choose any 
fixed coordinate system on M and set 




(107) 




= 0 (108) 



with Qa = Exp^cjo;. 



0 Ja(x) = ai 3 ^s(T(x))(x'^ - Z^(r(x))(x^ - 7(r(x))(x'^ - z'^(t(x))?/)q,(x) (109) 
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where Uf^ry^ is a smooth function of 5'. This determines the vector field uoa by giving 
its components in the chosen coordinate system. This uja and its first two derivatives 
vanish on L so the above results apply to it. 

Consider the ^-space integral in (108) with this uoa for some fixed value sq of 5’. 
It depends on uj^ only through its value on T^(so), and for x G H(so), r(x) = sq 
by construction. The value of this ^-space integral is therefore unchanged if r(x) 
is replaced by sq throughout the definition (109) of uJa- Now choose the coordinate 
system used in (109) to be the normal coordinates given by the components of 
X G Tz(M) for z = z(5'o) as described above. Then z°^(5'o) = 0 as this point is the pole 
of the normal coordinates so that (109), with the replacement just described, gives 

^a(z(so), X) = ap^s(so)X^X^X^'i>Miso), X) (110) 

where The Fourier transform of (1 10) gives 

k) = 

If this is put back into the ^-space integral of (108), the vanishing of that integral 
gives 

^ j = 0 ( 112 ) 

where 

= X ^7 • • • (113) 

«>o ”■ 



Since is an arbitrary smooth function of compact support on M, so also is on 
r^(M). Hence (112) completely d^nes as a generalised function (distribution) 
on T^{M) with Fourier transform N^. As is identically zero, so also is and 
hence also all coefficients in its power series expansion (113). Note that the Fourier 
transform of such a generalised function is analytic, see for example Gel’fand and 
Shilov [30] so the power series expansion is valid. Since a(^e3 is also arbitrary, the 
vanishing of the power series coefficients gives 

= forn>3. (114) 

For a general uJa the only surviving part of is (107). The other 

contribution to the variational equation (81) is given by (86), so with (68) 

that equation now takes the form 

j + (m^“ + \L(^^)VpiOa{z) + = 0 (115) 
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for all smooth vector fields on M of compact support. Interestingly, this is 
Mathis son’s original version in the case when only monopole and dipole moments 
are retained, but with the addition of external force and couple terms and 

It is solved by reducing it too to a form in which the 5' -integrand depends on uoa 
only through its value on T^(s). To do this, at each point of L define the projection 
operators 

pj^^xnav^ and ^ Af - />/ (116) 

where x = Aa is the unit tensor. If x“(m) is an affinely parameterised 

geodesic through z(s) orthogonal to at z, as used in Sect. 9, uJa on E (s) determines 

and ^uja) where x^ = dx^ /du. But x^V^x^ = 0 since x{u) 

is geodesic, so it also determines x'^x^V^puJa- Now at z the vector x^ can be any 
vector orthogonal to ria- It follows that 00 ^ on T>(s) determines the values of 

Wa, Qp'^-ftOa and Q'^QpV(e5)^a- (U7) 

The derivatives in (1 15) can be decomposed by projection into the parts given in 
(117) together with similar parts in which one or more of the Q operators is replaced 
by a P. Wherever a P operator occurs, it can be eliminated by using the identity 

= (118) 

followed by an integration by parts. The integrated part will vanish as uoa has compact 
support. If the P operator is not acting on the outermost derivative, the Ricci identity 
should first be used to commute the order of differentiation. In this way the integrand 
can be brought to a form in which uo^ occurs only through the projections (117). 

If uja is multiplied by a function constant on each T^(s), the projections in (117) 
will each be multiplied by this same factor. It then follows by the argument used 
above that the 5’ -integrand must itself be zero. But for a fixed value of 5’ the values of 
cjcf, and at z can be specified independently, subject only to the last 

of these being symmetric in 7 and f3. For the 5' -integrand to vanish, the coefficients 
of each of these derivatives must be zero. These results complete the solution of the 
variational equation. 

This process is best carried out in stages. The last projection in (1 17) can only arise 
from the last term of (115). The vanishing of the coefficient of therefore 

gives 

= o (119) 

which with (116) shows that must have the form 



( 120 ) 



32 



W.G. Dixon 



for some tensor that at this stage is not necessarily antisymmetric. But 

_ ^(7/3)0 ^(7o)/3 _ ^(/3o)7 (121) 

SO that (120) gives 

^7/3a = 1 (122) 

from which 

(123) 

If this is put back into (115), an integration by parts brings it to the form 

+ F“) LOa + = 0. 

(124) 

Now the second projection in (1 17) can only arise from the last term of (124), which 
leads to 

+ (125) 

In a similar manner to (120) it follows that 

^/3a_lS^_^W/3a^^/3 a (j 26 ) 

for some vector from which 

-\^-^ + \L^^VpWo, = p^^Wo,. (127) 

Finally this can be put back into (124), when an integration by parts brings it to the 
form 

j ~^ + F^^oOads = 0 (128) 




1 1^/3575 - ^ + F“ = 0 . 



d^ 



from which 



(129) 
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13 Equations of Motion 



The results of the previous section provide a complete solution of the variational 
equation but they are not yet in the simplest form. There is still a little freedom in 
the moments that can be used to make one further simplification. In the same way 
that the first few terms of were evaluated to give (104), so also the 

first two terms of as given by (55) can be evaluated to give 

J d^. (130) 

With and given in terms of and by (122) and (126), the contribution 
to (130) from the symmetric part of is 



1 

2 



d^ 



d5. 



(131) 



The two terms of the integrand combine to give a total derivative that integrates to 
zero since has compact support. It follows that is not determined by the 
moment-defining equation (80). 

This was foreseen in Sect. 1 1 where no new orthogonality condition was imposed 
to replace the case ^ = 1 of (97), namely = 0. That condition set a symmetric 

two-index tensor to zero, so it is not surprising that omission of a replacement has left 
another symmetric two-index tensor undetermined. The definition of the moments 
is completed by imposing 

5^^"^ = 0. (132) 



Since and are now both antisymmetric, the symmetric part of (126) gives 



while (122) simplifies to 





(133) 


^a/?7 = 


(134) 



The result (129) can be written in the form 



d^- 






a 

■( 3^6 



-h 



(135) 



while the antisymmetric part of (126) gives 
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Equations (133) and (134) give the forms of the monopole and dipole moment 
tensors in terms of the momentum vector and antisymmetric spin tensor 
while (135) and (136) are the equations of motion that the momentum and spin must 
satisfy. 

These are the only equations that govern the evolution of the moments. The quadru- 
pole and higher moments are completely specified by the tensors J since their 
^-parts vanish by (114). The J tensor for the 2'^-pole moment, n > 2, has (n + 2) 
indices. It satisfies the symmetry conditions (94) and (95) and orthogonality con- 
ditions (99) which leave it with (n + 2) (3^ — 1) linearly independent components, 
as seen in Sect. 1 1 . These tensors determine the gravitational force and torque 

L»l} 

as will be seen in Sect. 15 but their evolution with time is not constrained by the 
energy-momentum conservation equation (3). This is the same situation as in New- 
tonian mechanics. Such higher moments are of course still governed by physical 
laws, but these are solely the constitutive relations for the material of the body. 

This resolves the second issue seen in Sect. 4 to arise from Mathisson’s paper of 
1937. The key to this resolution is the complete separation between the t and J tensors 
achieved by the modification of the orthogonality conditions made in Sect. 1 1 . Had 
this not been done, there would have been higher order J tensors occurring in the 
variational equation. These would have led to the infinite set of evolution equations 
foreseen in Sect. 4, even if it had still been possible to solve that equation exactly, 
which may well not have been the case. The change of orthogonality conditions made 
a second modification to the moment definitions beyond that made in Sect. 9 even 
though it did not change the defining equation (80). This change achieved invisibly 
the elimination described in Sect. 4, but despite it being invisible the issue was real 
and the elimination essential. 

Recall now that the world line L is still arbitrary and that, as mentioned at the start 
of Sect. 7, the parameter 5' in the development so far need not be proper time. For the 
rest of this section only, take it as such, so that v^Va = 1. If (136) is multiplied by 

it then shows that 

o caf3 

p°^ = Mv°‘ + vg- L°‘^vg ( 137 ) 

d^ 

where M = p^Va^ Putting this back into (136) gives 



8 5 "^ 
d^ 



-h 



d^ 



— 



d^ 






(138) 



It can now be seen how these results are related to those of Mathis son and Papa- 
petrou. Their results are obtained by eliminating p^. When quadrupole and higher 
moments are neglected, and are both zero. The form (138) then agrees 
with (15) and if (137) is put back into (135) it then gives (17). A comparison with 
Mathisson’s quadrupole approximation will be made in Sect. 24 after the force and 
torque terms and the higher moments have been studied in more detail. 

There is also a direct correspondence with the results of Pryce and Mpller in fiat 
spacetime that were studied in Sect. 5. In contrast to the results of Mathisson and 
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Papapetrou in general relativity, those results treat an extended body exactly, as do 
ours. They therefore provide an important check. If they cannot be reproduced by 
work that claims to be exact in curved spacetime, something is wrong. Fortunately 
there is agreement. Again, being flat spacetime, and are both zero. The 
original form (136) of the equation for then agrees with (28) and correspond- 
ingly (137) and (138) agree with (29) and (31). These results add support to the 
identification of and as the momentum and spin of the body. 



14 The World Function 



The exponential map played a central role in Sect. 7 in enabling the use of Fourier 
transforms in a curved spacetime. So far it has not been necessary to write explicit 
forms for the maps it induces between fields on the spacetime manifold M and those 
on its tangent spaces, but this cannot be put off indefinitely. There are two important 
tasks still remaining for which this is essential, namely evaluation of the force and 
torque and and derivation of explicit expressions for the moments in terms 
of the energy-momentum tensor. 

The tool needed for this is the world function, first used extensively in general 
relativity by Synge [31], although the notation used here follows that of DeWitt and 
Brehme [32]. As it is not in common use, however, this section will define the world 
function and derive the properties that will be needed. 

The world function is the two-point scalar function defined by 

1 dx^ dx^ 

a(x(ui),x(u 2 )) = 2 (^ 2 -ui) / ga^(x(u))- — du (139) 

Jui dw du 

for two points on a geodesic x(u) with affine parameter u. This is well defined for 
any two points that can be joined by a unique geodesic. It will be assumed that this 
holds in the situations in which it will be used. 

Let 6x(u) be a variation of the path of integration in (139) and Sa be the corre- 
sponding change in the value of the integral. Then by an integration by parts, as in 
the standard derivation of the Euler-Lagrange equation of the calculus of variations. 



Sa = (u 2 — u\) 




dx"^ du 



(140) 



where = dx^/du. But 5 jdu = 0 since x(u) is an affinely parameterised 
geodesic, so the integral term vanishes. 

For convenience of notation, from now on the two endpoints will be labelled as z, 
X corresponding iou = u\, U 2 respectively. Indices ct, /3, 7 , . . . will refer to the point 
V and K, X, p, . . . to z. This index convention will be used throughout the remainder 
of this paper and when results obtained so far are quoted, they will be amended to 
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comply with it where necessary. Covariant derivatives of a will be denoted simply 
by appropriate suffixes, e.g., where acts at x and V,. at z. With 

this notation (140) can be written as 



5a = aa5x°‘ + a^5z'^ = {u 2 - mi) (gapU^Sx°‘ - g^\U^5z'^'^ (141) 

from which 

= (u2 — u\)U^, = (mi — U2)U'^. (142) 

The two equations are equivalent, the term in the bracket that has the positive sign 
being the m - value corresponding to the point at which the derivative of a is evaluated. 

The integrand in (139) is in fact constant along an affinely parameterised geo- 
desic, so 



(7 = \{U2 - = i(M2 - uifg^xU^U^ 


(143) 


which with (142) gives 

2a = aacr^ = o'Ka'^. 


(144) 


Differentiating (144) then gives the important relations 




cr =a.^a =a.f^a>\ a =a.^a =a.^a 


(145) 


from which 


(146) 


This shows that 

(M2 - Ml) = 0 


(147) 


along geodesics through z, a result that will be useful later. 

It is often useful to consider these two-point tensors in the limit x ^ z, when they 
become ordinary tensors at z. This is known as their coincidence limit and denoted 
by (•••). If (142) is used in (145) and this limit taken, it can be seen that 


II 

II 

1 

II 


(148) 


where the deltas are the Kronecker delta symbol. 

Now the definition of the exponential map is that x = Exp^X for X g T^{M) if 


X” = (M2 - Ml)?/”. 


(149) 



By (142) this can be written as 






cr‘^{z,x). 



(150) 
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For fixed z the components X'^ are functions of x and can be considered as the 
coordinates of x in a new coordinate system. By its construction, this is a normal 
coordinate system with z as its pole. 

The map Exp^ defined in Sect. 7 takes a two-point function (/:?"; (z, x) on M that is 
a tensor at x and scalar at z and maps it to a function O"; (z, X) on (M) whose value 
is a tensor at z. In component form the components of O"; are nothing other than the 
components of (p[" in this normal coordinate system. The coordinate transformation 
is given by (150) from which 

dx>^ Qx^ -1 

dx^ Qx^ ^ ^ 

The second form has the matrix inverse made explicit as otherwise denotes 
the double derivative with the index a raised with which has a different 

value. The tensor transformation rule therefore shows the map Exp"^ to have the 
explicit form 



-1 

• • • (- ct^a) • • • (152) 

The object O"; can also be considered as a two-point function on M with x = 
Exp^X as its second argument. With this interpretation it is a scalar at x and a tensor 
at z. The individual terms in (152) then all appear as tensors rather than as elements 
of a coordinate transformation. Different operators for differentiation apply in the 
two interpretations. When considered as a two-point function on M, the standard 
covariant derivatives Vq, and V,. apply. Considered as a function on T^(M), there 
should also be two covariant operations of differentiation. One is defined by 
(75). What to take for the other will now be investigated. 

Since O"; is a scalar function of the argument x, or X as appropriate, both 
and reduce to partial differentiation. The relationship between them is just the 
usual chain rule for a change of variables, which with (151) gives 

-1 

where ~ • (153) 

When O;;; is treated as a function on T^(M), the operator describes variation 

of X G T^(M). The point z can also be varied, but the tangent space on which O"; 
is defined then also varies. It therefore needs to be considered not as a function on 
a single tangent space but on the tangent bundle T M, which for an ^-dimensional 
manifold M is a 2^ -dimensional manifold formed from the union of all its tangent 
spaces. It is in this context that there is a counterpart to , a derivative operator 

that describes variation of z and is such that X^^X^ = 0. To motivate its construction, 
consider as a function of z and x. Then (150) gives 

= -Cj\, 



(154) 
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from which 



-1 









(155) 



So if ^ K,^ is d-cfincd- by 



= ¥«$::: + where = - <7“;^ a\ (i56) 

then Vkh=X^ = 0 as required. The minus signs in the definitions of and 
ensure that their coincidence limits are the unit tensor, as follows from (148). It 
is shown in [25] that and together form the covariant derivative on T M 
determined by the affine connection on TM induced by that on M. 

There is a component form for analogous to (75) for To derive it, write 
the covariant derivatives in (156) in their component forms to give 






so::: 






so::: 



+ r -terms 



(157) 



where the variables being held constant in the partial derivatives are marked explicitly 
and the connection terms arising from the tensor indices on O::: have been suppressed. 
In a component form for partial derivatives need to be in terms of and 
rather than z^ and To make this change, note that 





d^Z. 


d^Z. 


dX^ 


5z^ 


X ~ 




z 



and K^, 



SO- 



dx^ 



A so::: 

dX>^ 



(158) 



where the second result makes use of (151) and (156). But 



- (159) 

When these results are put back into (157), two partial derivative terms cancel to 
leave the final form 



dX^ 



K = = - — 

Sz" 






SO^ 

-fl 

dz'^ 



~rp X' 



SO^ 

dxp i^iJi -v’ 



(160) 



One contravariant and one co variant index has been retained here to show the T- 
terms suppressed above, which are simply the usual ones that arise from acting 
on the tensor indices at z. 

The operator has an important property related to integrals over the tangent 
spaces. Consider the absolute derivative along L of a tensor function 4^.-.". (z) formed 
by integrating 0:::(z, 2f) over T^{M). To evaluate this, the various tangent spaces 
along L need to be mapped into one another so that the integrands can be compared. 
Let parallel transport be used to create this mapping. Then in component form 
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/ 



I <!>::: (z,x)Dx 








d^Z 




('’ dz^ 


^ dX^ 


ds 1 



DX 



(161) 



where the dependence of X'^ on 5' is parallel transport, so that dX^/ds + = 

0. On substituting from this into (161), the result is seen to hold for all v'^ when 



d 

dz'^ 



/ 










dX^ ) 



DX. 



(162) 



The covariant derivative has connection terms arising from the tensor indices. If these 
are added in then comparison with (160) shows that 



7 ^J <d:::(z,z)dx = J v«*(&:::(z,x)dx. 



(163) 



A similar method but with parallel transport also of the argument vector can be 
applied to the definition of the Fourier transform to show that commutes with 
the operation of Fourier transformation. 

Suppose now that x(u,v) describes a family of geodesics with v constant and 
u an affine parameter on each geodesic. Let = dx^ldv. Then satisfies the 
equation of geodesic deviation along each geodesic. This is indeed the definition of 
that equation. Consider (142) for these geodesics with z denoting z(v) = x(ui, v) 
but V now taken as the generic point x(u, v) with u variable. It gives 



cr^(z(i;), x(u, v)) = (u\ — u)U'^(v). (164) 



Note that the tangent vector at z(i;) is independent of u. Both sides of (164) are 
vector fields along the line given by u = u\, v variable. Its absolute derivative h /dv 
along this line gives 



= (ui-u) 



8t/« 

dv 



(ui — u) 



5 

du 



(165) 



The vector field is defined only on the surface formed by this family of geodesics, 
but if it is extended arbitrarily off this surface then SV^/dw = The 

extension is required only to make the covariant derivative meaningful. With this 
and (164), (165) can be put in the form 



t/A I \tOL ^Av7 

from which 



where 






— CF . 






— CF . 



(166) 

(167) 



(168) 
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as in (153) and (156). This gives a formal solution of the geodesic deviation equation 
with given initial conditions. As a Killing vector field satisfies this equation along all 
geodesics, it also determines the entire field from the value of it and its first derivative 
at one point. 



15 The Gravitational Force and Torque 

The solution (167) of the equation of geodesic deviation provides an explicit expres- 
sion for as defined by (69). It gives 

(169) 

Now in the notation of (168) and as a special case of (152), the G,^\ defined by (72) 
is given by 

G«A = (170) 

Note that indices k, X, .. . now need to be used on fields that are images under Exp^ as 
their values are tensors at z rather than at v . This is in accordance with the convention 
introduced in the previous section. Then as defined in (72) can be put in the form 

S« = G«aT^ where = a\A^^ + (171) 

with (79) becoming 

3«a = V,(«(Ga)^T^) - G«a^T^ (172) 



This gives 

S^A = (g,(«V*a)<t^^ + 1(t('^V*,G«a) + (X,G«Am “ V*(«(Ga)^X,)) 

(173) 



when the explicit form (171) is used. But 

(174) 

by (153) and because = 0, which holds as the product is minus the 

unit tensor. On using this product again with (170), this result gives 

Gk«V*a)ct('^ = (175) 

Moreover, (153) and (168) give 

<t%V*,G^X = <ri V«G«a = K'^^^aG^x. 



(176) 
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From (175) and (176) with (156), (173) can be put into its final form 

S«A = - V,(«(Ga)^X,» . (177) 

This expression for E,^x enables explicit expressions for the force and torque 
and L,^x to be extracted from (86). With use of (55) and some straightforward 
manipulation of the Fourier transforms, it gives 

Ff, = J F[V^*G«a] Dfc, (178) 

— ^2 J (i GfcA[/jWi/]Wi tn fcf{V*[^Giy]A^ Dfc. (179) 

Now = i by (84) and by (114) all the terms in the expansion of vanish 

except those involving and The contributions from these two terms to the 
right hand side of (179) separately evaluate to zero. To see this, note that 
by (85) and so is symmetric, and that Gxk(z, X) = gx^iz) and V*^Gak(z, X) = Oat 
X = 0 since G/^a gives the components of the metric tensor in a normal coordinate 
system with pole z. 

It was shown in Sect. 14 that commutes with the Fourier transform operator 
F, so in (178) it can be brought outside the F term. With these simplifications (178) 
and (179) now reduce to 



Fi, = j V^,G,a D^, (180) 

J (G«A[MV*^]m''^) Dk. (181) 

These are the final exact expressions for the force and torque exerted on the body 
by the gravitational field. They do not depend on any assumptions of analyticity, as 
although is defined by (56) as a power series, it is convergent for any body of 
finite spatial extent. Taken together with (135) and (136) they provide exact equations 
for the evolution of the momentum and spin along the chosen world line L . To provide 
complete and exact equations of motion, all that is required is to add a supplementary 
condition that specifies L uniquely as the world line of a suitably defined centre of 
mass. This will be addressed in Sect. 21. 



16 Multipole Expansions 

When the series (56) for is substituted into (180) and (181), it shows that higher 
multipole moments of the body interact with successively higher derivatives of the 
gravitational field. It was noted in Sect. 4 that this was to be expected, but (135) and 
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(136) show that these interaction terms are the only occurrences of the quadrupole 
and higher moments in the equations of motion. It is only this latter fact that makes 
it valid to neglect higher moments when the size of the body is small in comparison 
with the length scale of variations in the gravitational field. This is the multipole 
approximation, valid as what is being neglected is not the higher moments themselves 
but is instead their diminishing contributions to the force and torque. 

To make the multipole approximation the series (56) for is truncated on 
substitution into (180) and (181). The k-space integration can then be performed 
term by term to give 



1 y (-i)" f ^ . 

2(2:t)4;^ .! J ^ 


• ' k(jX Xjj, Dk, 


(182) 


2 y (-1)" j 

»! J 


k(j • • • kq-Gjj,i>p T)k 


(183) 



where the series extend to the desired multipole order. 

Note that in these integrands is the position vector in the tangent space, the role 
that was played by in (156). So as was seen following that equation, = 0 

and since also = 0 then X/^^kx = 0. The in (182) can therefore 

act on the whole integrand and so by (163) can be brought outside the integral as an 
ordinary covariant derivative. With (76) the k-space integrals in (182) and (183) can 
then be performed to give 

= (184) 

n>0 

= Z (185) 

n>0 

The G -terms are all evaluated at (z, X) with X = 0 and so, with the interpretation of 
the components of as normal coordinates on M with pole z, they are the tensors 
whose values at the pole of a normal coordinate system reduce to repeated partial 
derivatives of the metric tensor gx^. 

Given any tensor field, there is a unique tensor at a point z whose components 
reduce to the ^th partial derivative of that field at z in any normal coordinate system 
with pole z. This is known as the ^th extension of that tensor field, a concept intro- 
duced by Veblen and Thomas [33], for which see also Thomas [34] and Schouten 
[35]. Following Veblen and Thomas, the additional n indices of the ^th extension 
will be suffixed to the tensor and separated by a comma from the original indices. 
Note that such extensions are symmetric in their additional indices, unlike repeated 
covariant derivatives. So 
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= g\^^^...p{z) whenX = 0 (186) 

which enables the multipole expressions for the force and torque to be put in the 
forms 

= rnP-^^^V^gxp,p...a, (187) 

n>2 

= Z (188) 

n>\ 



where, as before, the curly bracket notation for indices is defined by (67). 

In principle the summations extend from ^ = 0 but since = 0 

then the first two terms of (187) and first one of (188) vanish. The lower limits to the 
summations have therefore been adjusted to take account of this. 

The moments occurring in these expressions are the quadrupole and higher 
moments that are described primarily by the tensors J . Since the tensors t ' vanish 
for these moments, (87) and (96) show that the m’s are given in terms of the /’s by 

4/7 

n -\- 2. 

The general expressions for force and torque are simpler if left in terms of the m’s 
but this enables the expansion to any particular order to be expressed in terms of the 
/’s when required. 

The Veblen extensions of the metric tensor occurring in these expressions can be 
expressed in terms of the curvature tensor and its covariant derivatives. This is not 
entirely straightforward, however, so the tools for doing so will be developed in the 
next section. 



17 Evaluation of the Veblen Extensions 

The relationship between Veblen extensions, covariant derivatives and the curvature 
tensor is most easily developed via another concept, normal tensors, also due to 
Veblen and Thomas [33]. The nth normal tensor is the tensor that reduces 

at the pole of a normal coordinate system to the nth partial derivative of the 

affine connection. 

A normal coordinate system with pole z is one in which the coordinates of z are 
all zero and lines of the form x^(u) = uX^,X^ constant, are affinely parameterised 
geodesics through z. These lines therefore satisfy the geodesic equation 

^ dx^ dv^ 

du^ du du 



( 190 ) 
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Since also — 0 for ^ > 2, differentiation of (190) n times shows that 

• • • X^X^X'^ ds...sT^^ = 0. At z this must hold for all X^, hence = 0 

at z. Together with the definition of the normal tensors this shows that 






( 3 ^ 6 - 



■e - ■ (Pj){6-s) 



and 



W-.) = 0. 



The curvature tensor is given in terms of the affine connection by 



(191) 



= + (192) 

This, and successive covariant derivatives of it, evaluated at the pole of a normal 
coordinate system show that 

= 2^W-C + (193) 

where O(N^) denotes terms involving two or more normal tensors of lower order. 
Taken together with the first equation of (191), this enables any pair of indices of a 
normal tensor to be exchanged. If the symmetrisation in the second equation of (191) 
is expanded and the indices in each term rearranged in this way into a common order, 
it will give the ^th normal tensor in terms of curvature terms and normal tensors of 
lower order. All normal tensors can then be evaluated successively in terms of the 
curvature tensor and its derivatives. With all lower indices, the first two are found 
to be 



P^a/3^6 — ^a(3^5e — {5^e){[3^)a 5 ^(/3^7)(^£)cf (194) 



The formula for the Christoffel connection gives 

9^9a(3 — (195) 

Evaluation of this and its partial derivatives at the pole of a normal coordinate system 
then gives the Veblen extensions of the metric tensor in terms of normal tensors, the 
first two being 



9a(3,^S — — 3^cf(7(5)/3» 9a(3,^5e — '^P^{al3){'ySe) — ^ {'j ^d\al3\e) • 

(196) 

Note that in the normal tensor expressions, the symmetrisation over the extension 
indices is not necessary and does not arise from (195). Even without this explicit 
symmetrisation the results must be symmetric in these indices as it has already been 
shown that the Veblen extensions have this property. Adding the explicit symmetrisa- 
tion therefore does not affect the result and has the advantage of simplifying further 
evaluation with (194). Veblen extensions of other tensor fields can be similarly related 
to repeated covariant derivatives via normal tensors, but will not be needed here. 
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18 The Moment Definition Recast 



It was seen in Sect. 4 that Mathisson’s assumptions of analyticity made it impossible 
to deduce explicit expressions for the moments from his implicit definition of them 
through (51). Sections then showed that by modifying the defining equation into 
the form (54) the analyticity could be avoided and that this would enable explicit 
and unique expressions to be obtained for the moments as integrals of the energy- 
momentum tensor. But (54) was not the final definition. Motivated by a study of the 
variational equation, (54) was replaced by (80) which became the final form for the 
moment-defining equation. 

Mathisson’s orthogonality conditions were also modified as described in Sect. 11. 
All these changes mean that the methods of Sect. 8 can no longer be applied directly. 
The defining equation will now be recast into an equivalent form that is amenable to 
those methods. 

Consider (80) now with a ipa(3 that does not necessarily have the form (65). Let 
Xa{s,x) be constructed from as in Sect. 9, as the function that satisfies (66) 
along all geodesics through z(s) e L with initial conditions (71). Define 



e„/ 3 (s,x) = - V(aA^)(s,x), = Exp^e„/3- (197) 

Then with defined by (59) and A/^ by (72), (197) and (74) together give 

(198) 



The defining equation (80) can therefore be put in the form 

J Dx = + V*(«Aa)]. (199) 

The advantage of this version is that, as will be proved below, necessarily 

has the form 



E^x(s, X) = X^X^E^^xAs, X) (200) 

for some with the symmetry properties 

— ^[/€/x][Az/] 9 ^KlllXiy] — 0- (201) 

It follows from these that also so any E,^x constructed from such 

an Ef^fj^xu by (200) is symmetric. 

The Fourier transform of (200) gives 

E^xis, k) = {-xfg>^Pg''^^^p^^aE^^xAs, k) 



(202) 
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SO that from (55) and (93) 

k)E^i,x^{z{s). k) (203) 

where 

k) = y k^--- kpr-P''^^^^(s) (204) 

r. n\ 

n>0 

and as elsewhere, n is the number of indices in the set a • • • p. The fields k) 

and k) both have the same symmetry properties (201) as X), the 

former as a result of (94). 

Due to the orthogonality condition (99) the method of Sect. 8 shows that the 
5’ -integrand of (203) depends on Ejj^xi^ only through its value on the hyperplane 
^^2f^ = 0ofr,(,)(M).If 



^S'jPa — SO that €po; — (7 cr'^ ^ (35 -> (205) 

this dependence translates as that on es^^a(s, x) only through its value on the usual 
'E(s), as in (63). 

The contribution M^^[V^(/^Aa)] to the right hand side of (199) is given by (83). 
This has already been studied in Sect. 12, where it was found that only two terms 
survive in the series (84) for due to (114). These terms still reduce to (104), and 
(105) and (106) still hold, giving 

M«4v*(«Aa)] = J [p^v\s)^^x(z{s)) + S^\PVn^x^^{z{s))) ds (206) 

in place of (107) since no longer has the form (65) but and have the 
forms (133) and (134). 

The right hand side of (199) is therefore an 5' -integral whose integrand depends 
only on the values of ip pa and V^cppa at the point z(s) and of es'jpai^, on Ti(s). 
This restricted dependence allows the methods of Sect. 8 to be used to evaluate all the 
moments as integrals of the energy-momentum tensor. This will be done explicitly 
for and in the next section and for the higher moments in Sect. 24. 

It remains to prove the existence of an satisfying (200) and (201). The 

proof uses the results of Sect. 14 and is by construction, but for a given E,^x^ Ei^xiiu 
is not unique so no significance will be attached to the particular constructed value. 

For a fixed 5', let v(m) be an affinely parameterised geodesic with z{s) = x(0). 
Indices a, .. . will refer to the generic point v(m), so (142) gives = u~^ 
where = dx^/du as in Sect. 14. Note first that 



Cap = 0 at V = z{s) 



(207) 
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by (71) and that (66) applied to can be put in the form 



= 0 (208) 

where curly brackets around three indices are defined by (67). Multiplication of 
(208) by shows that U^U^eap is constant along these geodesics and the initial 
condition (207) then shows its value is zero. By (142) this gives 

= 0. (209) 



Differentiation of (209) and use of (208) shows that 

a°‘al^Vaei3j + = 0 . ( 210 ) 

If this is multiplied by use of 

( 211 ) 

enables the result to be put in the form 

|-( 7 />%^) = 0 . ( 212 ) 

The formula (211) follows from (147) on differentiation of as a matrix inverse. 
It follows that H^^a^epj, which is a vector at z and scalar at x, is constant as x varies 
along any geodesic through z. As it vanishes at z, it therefore vanishes everywhere, 
so that 

^ for all X (213) 

from which 

X^^E^x(s,X) = 0. (214) 



Differentiation of (214) gives 

E^x(s, X) = . (215) 

But also 

= (216) 
where w is a scalar parameter, which together with (215) gives 



(s,uX) • 



^ (uE^xis, uX)) = 2mZ^ [V*[^£«]a] 



(217) 
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Iterating this result for the second index of E,^\ shows that 
d^ 

^ {uE^xis, uX)) = AuX'^X^' . (218) 

Since both uE^\{s, uX) = 0 and d{uE^\{s, uX))/&u = 0 at w = 0, (218) can be 
integrated to give (200) with 

which satisfies the symmetry properties (201) identically. The existence proof is now 
complete. 



19 Momentum and Spin as Integrals 



By (63) the integration on the left hand side of (199) can be split into one over E(^') 
followed by one over 5'. If (197) is used in the H(s) integration, this gives 

Dx= [ ds [ T'^!^ (e„/3(s, x) + x)) (220) 

Jl J^{s) 

where, as in Sect. 8, is any vector field such that = 1 with r{x) being 

the scalar function such that r{x) = 5' if v G T>(s). Now the forms obtained in the 
previous section for the two terms of the right hand side of (199) no longer involve 
Ac^. It should therefore be possible to eliminate it also from (220). To do so, put 

= Ao;(r(v), v). (221) 



Then 



V/3Z^a = + VpT 



'd\a 



ds 



{s=t{x)) 



( 222 ) 



If this is used in (220) then the term with can be converted back to an integration 

over all space and integrated by parts, when it vanishes by the covariant conserva- 
tion equation (3). Also dE^ = dE/^ since and dE/^ are parallel and 

= 1. With this, (220) becomes 



This does not yet eliminate Ac^ but it will now be shown that dXa/ds can be 
evaluated on E ( 5 ') from on that hypersurface, taken together with suitable initial 
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conditions. Since Lpa( 3 {x) is independent of 5*, differentiation of (197) with respect 
to 5* shows that 



d^a(3 ^ 

ds ds 



(224) 



This has the same form as (65), so by analogy with (66), 



du^ 




+ 



ds 






ds 



(225) 



holds along all geodesics through z(s). The initial conditions follow by differentiating 
(71) with respect to 5’. As A^(^’, z(^’)) has two dependencies on 5’, this gives 



^ = 0 
OS 

and 

aXx = z(s), which with (71) and (106) can be put in the form 




where 

= -v^ipi^x(z{s)), b,^x(s) = -2vf^V[^ipx]iiiz(s)). 



(226) 



(227) 



(228) 



(229) 



This appears at first not to meet the need as de^^jds cannot be evaluated from 
on E (s) alone. However, a^de^plds is determined by on E (s) as is seen by 
differentiating (213) and this enables the full right hand side of (225) to be evaluated. 
So dXa/ds can be determined on E ( 5 ') from an equation that only requires knowing 
x) on this hypersurface, together with initial conditions that only require 
cpi^x and 'V/^ipxn at z(s), all of which occur also in the two parts (203) and (206) of 
the right hand side of (199). 

To match, for its left hand side, the separation of terms made by (203) and (206) 
for the right hand side of (199), it is necessary to split the calculated dX^/ds into 
two parts according to the data on which they depend. Let 2 pa(s,x) he the solution 
of (225) corresponding to initial conditions (228) with < 2 /^ = 0 and b,^x = 0- This is 
determined by e^pis, x) alone. The difference between this and dX^/ds is a solution 
of (225) with the right hand side set to zero, i.e., a solution of the geodesic deviation 
equation determined by the initial conditions (228) with (229). That solution is given 
by (167). Putting both parts back into (223) gives 
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I Dx = l^ds(^j (r^>^eai3W^ - dE^ 

+ p’^v^ipnx + 



where 



and 



M^^E^x] = f {T'^'^eapw'^ - dS^ 

J i/ S (5 ) 

M"^[V*(«Aa)] =Jds (^p^v^if^x + 



while (203) and (205) show that (232) itself holds if 

[ dE^, = 

iS(5) ^ ^ 

J DkJ‘'^^^'^{s,k)E^^XKis,k). 



(230) 



p^{s)=[ X^''7’“^dE^, S^^{s) = 2[ d'Z/s. (231) 

Js(5) Js(5) 

It follows that the defining equation (199) holds if, separately, 



(232) 



(233) 



(234) 



This latter equation is the true equivalent for the revised moment definitions of (64) 
for the original ones. Both sides depend on only through its value on 

Ti(s). As seen in principle in Sect. 8, quadrupole and higher moments J therefore 
exist that satisfy (234) and expressions can be deduced for them as explicit integrals 
of over S(5'). Moreover, comparison of (233) with (206) shows that they have 
identical forms, so that (231) gives possible explicit integral expressions for the 
momentum and spin 5'^^ constructed from the monopole and dipole moments by 
(133) and (134). 

This shows that moments exist which satisfy the defining equation (199) and 
that explicit expressions can be obtained for all of them as integrals of the energy- 
momentum tensor. It does not, however, prove their uniqueness. Such a proof requires 
(232)-(234) to be deduced from (199) rather than being seen as just one possibility. 
The barrier to this deduction is that the expressions so far obtained for the two 
sides of (199) assume that (fa[3 has been chosen arbitrarily and that ea^^yS has been 
constructed from it. This is appropriate for the form (80) of the defining equation, but 
in (199) with (230) it is eap^ysi^, x) rather than that serves as test function in the 
functionals concerned. A proof of uniqueness therefore requires it to be eap^dis, x) 
that is freely specifiable on each Ti(s), or equivalently ec^p^si^M^ to be freely 
specifiable on M. 
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This will be taken up again in Sect. 23 and explicit integrals for the higher moments 
will be derived in Sect. 24, but there is one remaining aspect of uniqueness that can be 
dealt with now. The momentum and spin variables in (233) and (206) were identified 
with one another merely by similarity of form. The methods of Sect. 12 will now be 
used to prove that they are indeed the same. With the notation of (1 16) the projection 
procedure of Sect. 12 shows that 



^ 0 (235) 

where barred variables denote the difference between the two versions of the corre- 
sponding unbarred ones. This shows that = 0. As is antisymmetric it 

therefore vanishes with a projection by on either index, and for the same reason 
also with a projection by on both indices. It follows that = 0 so the two 
versions of 5'^^ are equal. It then follows immediately that the two versions of 
are also equal. 

The two-point tensors and both reduce to the unit tensor in a Minkowskian 

coordinate system in fiat spacetime. The expressions (231) for momentum and spin 
therefore agree with the corresponding quantities defined by (26) and (27) in such a 
spacetime. This adds yet further support to their identification as the momentum and 
spin of the body. 

Note that for a given world line L, the values of p'^ and depend, through the 
construction of E (i-), on the choice of vector field n'^ along L. This is so even though 
this field enters the moment definitions only through orthogonality relations for the 
octopole and higher moments. This illustrates how sensitive the values of even the 
lower moments are to details of the full set of moment definitions. 



20 Conserved Quantities 

There is one crucial difference for the momentum and spin integrals between the 
situations in fiat and curved spacetime. In a fiat spacetime, where there are no grav- 
itational forces, they were defined with respect to a spacelike hypersurface E and a 
point z not necessarily on E but were then shown to be independent of the choice of 
E. In physical terms they are conserved quantities. The definitions in curved space- 
time can similarly be made with any such E replacing the specific E(^') of (231), 
but then they do depend on the choice of E . 

It will now be shown that whenever the spacetime admits a symmetry, there 
is a corresponding linear combination of components of the momentum and spin 
integrals that does have this independence from the choice of E . Flat spacetime is 
simply a special case in which there are as many symmetries, namely ten, as there 
are components of momentum and spin, so each component is conserved separately. 

A spacetime symmetry corresponds to the existence of a Killing vector field uo^. 
By (167) its value at any point v is given in terms of its value, and that of its first 
derivative, at any other point z by 
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(236) 

Hence 

^ r“/^w«dS;3 = p^LO^ + i S'^^V^LOx (237) 

for all points z and all spacelike hypersurfaces E that extend through the body. Here 
E is to be used instead of the prescribed E(5') in the integrals (231) for p'^ and 5^^. 

The difference between the value of the left hand side for two different choices of 
E can be converted to a volume integral that vanishes in virtue of the Killing equation 
and the covariant conservation equation for . Its value is therefore independent 
of the hypersurface E of integration, so this is true also of the right hand side. 
This is therefore a linear combination of components of momentum and spin that is 
conserved in virtue of the spacetime symmetry, as required. For any choice of z on 
L, E can be taken to be the E(^') used in (231). This linear combination is therefore 
constant along L when taken with the precise expressions (231) that occur in the 
equations of motion, which is perhaps a more natural interpretation of a conserved 
quantity. 

An alternative view of this conservation property comes from the motion of z 
along L. The right hand side of (237) is constant as z moves, and differentiation of 
it with respect to 5’ shows with (135) and (136) that 

F^uj^ + i = 0. (238) 

Each spacetime symmetry therefore leads to a component of the force and torque 
vanishing. 

These results may seem so obvious as to be hardly worthy of mention. They are 
not really obvious, though, they are merely expected. Their truth with the definitions 
of momentum, spin, force and torque given here provides a sensitive test of those 
definitions. By way of illustration, it may have seemed more in keeping with New- 
tonian mechanics and special relativity to have defined the force by 5 p'^/ds = 
so treating the “spin-curvature interaction” term in (135) as a contribution to the 
force. Had this been done, a spacetime symmetry would no longer have led to the 
vanishing of a component of force and torque. 

Now consider the expressions (231) in a fiat spacetime. This has ten linearly 
independent Killing vector fields, so the force and torque vanish identically. Also, 
for a fixed point z, the values of and are independent of the hypersurface of 
integration and therefore, for a given z, independent of the vector that determines 
the hypersurface used in (231). By letting z vary they can therefore be considered as 
well defined fields on the spacetime manifold. 

This is all true also for a spacetime of constant curvature, which provides another 
useful testbed for the definitions. In such a spacetime the curvature tensor has the 
form 



^a/3j6 — [55 9 ad 9 [3^^ 



(239) 
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where k is constant, so (135) and (136) reduce to 



= kv^S^x, v>^Vij,Sk.x = 2v>^p[f,gx]ij.. 
Since the choice of L is so far unspecified, this holds for all i;^, giving 



(240) 



^XPk. — kSi^x, — 2p[i^gx]i_i- 



(241) 



In a fiat spacetime ^ = 0. In a Minkowskian coordinate system the first of these 
shows that is constant. The second is then trivially integrated to show that depen- 
dence of on z has the form (34). When k is nonzero, (241) shows that V(a p^) = 0 
and hence that is a Killing vector field. Given this field, the first of (241) deter- 
mines the field 5'^^ which then satisfies the second equation as a result of (239). It 
follows that any Killing vector field determines an which, taken together, 
satisfy (240) and therefore the equations of motion (135) and (136) along any choice 
of world line L. So as in the case of fiat spacetime, and can take any values 
at a single point z. The first of (241) gives ^xPk at that point, (167) then gives p,^ 
everywhere from which is determined everywhere by a second use of (241). 



21 Centre of Mass 

Although they have been referred to frequently as equations of motion, (135) and 
(136) are in fact only evolution equations. They describe how and 5'^^ vary as 
the point z moves along an arbitrarily chosen timelike world line L. They become 
equations that describe the motion of the body only when a supplementary condition 
is given that determines L uniquely, preferably in a way that enables it to be consid- 
ered as the world line of the centre of mass of the body. The problem of uniqueness 
was the final issue identified in Sect. 4 as arising from Mathisson’s paper of 1937. 

Section 5 identified a pointer towards the type of supplementary condition that 
would determine L uniquely. This was to use pxS'^^ = 0 in place of Mathisson’s 
choice vxS'^^ = 0 that seems natural but which fails the uniqueness test. The big 
question this left unanswered was what definitions to use for momentum p^ and 
spin 5'^^ as they are not directly tensors within the moment set defined by (1) or its 
successors. Suitable quantities may be identified as constructs from the moments at 
any level of multipole approximation. These may provide uniqueness in that approx- 
imation but they do not provide a satisfactory definition of centre of mass, for two 
reasons. One is that there is an element of arbitrariness in the choice of constructs, the 
other is that the definition will change when higher multipole moments are retained. 

It was seen in Sect. 13 that when the variational equation is solved exactly, and 

have forms given by (133) and (134) that give precise definitions of p'^ and 
within the moment scheme. Exact and uniquely determined integral expressions (231) 
were found for these in Sect. 19 where it was seen that in a fiat spacetime they reduce 
to the generally accepted definitions (26) and (27) of momentum and spin in special 
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relativity. They are therefore the natural candidates to adopt in a supplementary 
condition p\S'^^ = 0. But making this precise is not entirely straightforward. The 
tensors and can be defined by (231) with respect to any base point z and 
timelike unit vector at z by taking, as before, T>(s) to be formed from all geodesics 
through z orthogonal to but both tensors depend on both z and 

A two-stage process is therefore needed. First consider as a function of z and 
the vector and for any z choose at z so that and n'^ are parallel, i.e., 

n^'^p^\z,n) =0. (242) 

This is an implicit equation that can be solved by iteration. With any initial choice 
take to be the unit vector parallel to p^(z, ^(/)). It is plausible that 
this process converges, at least for weak gravitational fields, as only one iteration 
is required in fiat spacetime. Then with this choice of ^^(z), and 5'^^ can be 
considered as functions of z alone. The line L can now be defined as the locus of 
points that satisfy 



px{z)S^\z) = 0. (243) 

Again it is plausible that this determines a unique L, at least for weak gravita- 
tional fields. 

A set of sufficient conditions for this to be so has been given by Schattner [36, 
37] where uniqueness is proved by use of a fixed point theorem for mappings. A 
definition with a two-stage process of this nature for an extended body was first 
given by the present author in [22] with different expressions for and 5^^, and 
with the present expressions by him in [27]. 

Support for this definition is provided by a test body in a spacetime of constant 
curvature. It was seen in the previous section that in this case p'^ and are well 
defined functions of z that are independent of the choice of This makes the 
solution of (242) be trivial. As in fiat spacetime, only one iteration is required. By 
(241), (243) is equivalent to 



p^Vxp'' = 0. (244) 

It follows, with (241), that p^’V^ip^'Vxp'^) = 0 wherever (244) holds, so if (244) 
holds at a point then it holds also along the orbit of p^ through that point. This orbit 
is therefore a solution for the desired world line L as (243) holds everywhere along 
it. By (244) that orbit is also a geodesic. It is the solution for L provided the solution 
is unique, a point which will be returned to in the next section where it will be seen 
that there is an exceptional case where uniqueness fails. This case corresponds to 
circumstances that cannot be physically realised, but its existence shows the need 
for conditions such as those of Schattner in order for uniqueness to be proved. 

For a test body in a spacetime of constant curvature, the above definitions therefore 
lead to the world line of the centre of mass being a geodesic. Again this is to be 
expected since the force and torque have already been seen to vanish, but its truth is 
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a sensitive test of the definition of centre of mass. Note also that is parallel to 
so that also = 0 on L. The difference between the two conditions, however, 

is that the second one does not determine a unique world line but the first one does 
so, other than in physically unrealistic circumstances. 



22 The Momentum- Velocity Relation 



With L chosen as the world line of the centre of mass, the velocity = dz'^/ds 
becomes determined by the motion of the body. It therefore becomes meaningful to 
ask how it is related to the momentum p^. The occurrence of in (135) and (136) 
is sufficiently involved that it is remarkable that an explicit solution can be found. 
Nevertheless there is such a solution, due to Ehlers and Rudolph [38]. 

Recall first that the entire development so far has left the choice of parameter 5' on 
L arbitrary, so is not necessarily a unit vector. Changing the parametrisation does 
not affect the form of the equations of motion (135) and (136), it only changes the 
force and torque F'^ and by a scale factor. Several aspects of the development 
from these equations are simplified by choosing 5' not to be proper time along L but 
instead such that 



= 1. (245) 

This corresponds to choosing d^-, at each point of L, to be the elapsed time interval 
in the frame in which the 3-momentum is zero, even though the mass centre itself is 
moving in this frame. With this choice can be called the kinematical velocity and 
the dynamical velocity of the body, and it will be adopted in this section only. 
With (242) in mind, define the mass M(^') of the body by 

so that = Mn'^. (246) 

Note that h{p\S'^^)/ds = 0 since (243) holds along L. Expand the derivative and 
use (135) and (136) to give 



= S'^^Wx (247) 

where 

+ M~^ L'^^X (248) 

and 

Wx = Fx+ ^^vPS^^‘'Rxpi,r.. (249) 

Now eliminate n” between (247) and (249) and multiply the result by to give 



S^Xwx = S^^iFx + \hf>S^''Rxppr.) + \M~^S'^^^Sf’^<^Sl^^WaRxppv. 



(250) 
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It follows from (243) that 5^^ has matrix rank at most 2, since it cannot have rank 3 in 
virtue of its antisymmetry. Hence = 0, from which 

If this is used in (250), it gives a linear equation for S’^^W\. When the solution is 
substituted into (247) it yields 

(m2 + - h^) = S'^\Fx + (251) 

which is the required result. The solution for satisfies the normalisation condition 
(245) since n,^h^ = 1, = 0. 

It was seen in Sect. 20 that and both vanish for a test body in a spacetime of 

constant curvature, where the curvature tensor has the form (239). The right hand side 
of (251) is then zero in virtue of (243). It follows that = n'^ unless + kS^ = 0, 
where S is the magnitude of the spin defined by = (S'^^Si^x)/2. In this exceptional 

case (251) is satisfied identically so v'^ cannot be determined. 

This confirms the result in the previous section that in such a spacetime the centre 
of mass follows a geodesic path with just one exception, the case kS^ = 0. 

This requires the spacetime to be of constant negative curvature, of radius R where 
k = —R~^, and the body to be such that S = MR. Such a body has a size comparable 
with the radius of the universe, rotating at a speed comparable with that of light, so 
it is not physically realisable. As a mathematical case, however, it does genuinely 
give rise to non-uniqueness of the world line of the centre of mass as defined in the 
previous section. It was shown in [27] that in this case the points satisfying (242) and 
(243) form a timelike two-dimensional surface. The world line L can therefore be any 
timelike line in this surface, so really is indeterminate. This surface also contains a 

two-parameter family of timelike geodesics so the conclusion of the previous section 
that there are geodesic solutions was also correct, even in this exceptional case. 



23 Uniqueness of the Moments 



It was seen in Sect. 19 that to draw conclusions from the form (199) of the moment 
defining equation, needs to be freely specifiable on H(s) for each 5'. 

This is needed in order to deduce (232)-(234) and then to extract unique integral 
expressions for the higher moments from the last of these. This issue was left open in 
Sect. 19. It will now be shown that for any choice of there exists a (pa^(x) 

consistent with this choice, i.e., such that the x) constructed from (fapM by 

(197) and that constructed from es^^ai^, x) by (205) are equal on each Ti(s). With 
this in hand, the uniqueness proof will then be completed. 

The proof of existence is by construction. Let variables chosen ab initio and others 
used in the construction of be marked with hats and those constructed from it be 
unhatted, as above. The requirement is to prove that the relevant hatted and unhatted 
variables are equal on the hypersurfaces ^(s). So assume now that a tensor field 
x) is freely chosen on each subject only to symmetry properties 
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e5-f(3a = ^^[7/3o] = 0- (252) 

Take the parameters and h^\ to be zero. For each 5', extend x) smoothly 

away from Ti(s) and set 



e/3a(s, x) = a^a"*e/3Sa-y(s, x). 



(253) 



Then epa is symmetric, 

e0c^(s,x) = O, x) = 0, ~^^(s,x) = 0 3 Xx=z(s) (254) 

os 

and a^epa(s,x) = 0 identically. (255) 

Solve the hatted (225) with (228) along all geodesics through z(s) to give dXa/ds 
for V both on and off S ( 5 ') . With initial condition Xa{s,x) = 0 for v G Ti(s), integrate 
the calculated dXa/ds to give Xa(s, x) for all 5' and x. By (221), this construction 
gives 0a(x) = 0 for all x and since a,^ = 0 and b,^x = 0, (228) with (222) and its 
derivative give 



V/3Ack( 5', x) = 0, V^/3 Ack( 5', x) = 0 atx = z(5'). 



Define 



(Pap(s, X) = eap(s, X) + V(c,A/3)(^, x) 



which from (254) and (256) satisfies 

(fa^is, x) = 0 and x) = 0 atx = z(s). 

Then the required ip^p is given by 

= 0a(5{r{x),x) 



from which 



VaP^P^ix) 



Vc(^/37(t(x),x) + V^r 



ds 



(r(x),x). 



It follows from (257) that 



d^a(3 

ds 



{s,x) 



d^aj3 



( 5 ’, x) + V(CK 



ds 



{s,x). 



(256) 

(257) 

(258) 

(259) 

(260) 
(261) 



Both terms on the right hand side vanish when x = z(z), the first by (254) and the 
second by (228) since h^\ = 0. So 
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— ^^(5’,x) = 0 atx = z(5’) (262) 

os 

which with (258) to (260) gives 

= 0 and = 0 atx = z(5’). (263) 

It follows from (229) that a,^ = 0 and = 0 in agreement with the assumed values 
for their hatted counterparts. 

There is now a complete parallel with (197), and with (66) as applied to Ac^, under 
the correspondence 






9^aP 

ds 






d^aP 



dXg 

ds 



(264) 



with the conditions (207) and (262) also corresponding. The cpap of (66) admittedly 
does not depend on 5' while its corresponding quantity does so, but 5' is not varied 
in (66) so this is immaterial. The steps leading to (213) remain valid under this 
correspondence, giving 



(7^ _ Q X 

OS 



(265) 



which with (260) shows that 

x) fovx e S(5’). (266) 

Differentiation of (255) shows that 



= 0. (267) 

This is the hatted version of (208), which was there seen to be equivalent to (66) for 
Ack, and (257) is the hatted version of (197). It follows that Xa satisfies (66) with (f^p 
replacing cpap- But unhatted Ac^ is constructed by solving (66) with unhatted cpap 
along all geodesics through z(s). The hatted and unhatted right hand sides of (66) 
are equal on H(s) by (266), so Ac^ and Ac^ satisfy exactly the same equation along 
all geodesics in H(s) through z(s). The initial conditions are the same, too, since 
Xi^(s,z(s)) and V \X,^(s , z(s)) both vanish, the first by construction and the second 
by (256), and their unhatted counterparts vanish by (71) with (263). The solutions 
are therefore identical on E(^') so that 



\^{s,x) = Ack(5',a:) forx G Ti(s) (268) 

or equivalently, by (221), UgM = for all v. By construction i>a(x) = 0 but 

this equality would have followed if any other value had been chosen such that it and 
its first derivative vanishes on L. 
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What remains is to show that VpXa(s,x) and V^Aq.( 5', x) are also equal on H(s), 
for then it will follow from (197), (257) and (259) that eap(s,x) and ea^(s,x) 
are equal on T>(s), as required. Since Ua = (222) shows that 'V^Xa(s,x) = 

V/3Ack(^, x ) on Ti(s) if dXa(s, x)/ds = dXa(s, x)/ds there. That these are equal at 
X = z(s) already follows since a,^ = a,^ = 0. 

For simplicity let barred variables denote the difference between corresponding 
hatted and unhatted ones, e.g., (pa^{s, x) = x) — ^a^{x). The notation will 

be as in Sect. 18 from (207) onward. Since is independent of 5', the unhatted 

version of (265) holds trivially, so 



Qd(pa(}{s,x) „ . 

= 0 for all X. 

as 

The barred version of (257) then gives 




along all geodesics through z{s). But (213) and (255) give 

x) = 0 for all x 



(269) 



(270) 



(271) 



from which U^U^dea[ 3 /ds = 0, so multiplying (270) by shows that U^dXa/ds 
is constant along all geodesics through z(s). This value is zero at x = z(s) as noted 
above, so 

dX 

cr^^=0 for all X. (272) 

os 

If (271) is differentiated with respect to 5', the result with (272) enables (270) to be 
put in the form 



= 2u ^v\g‘^^g^^ 6 ^( 5 - (273) 

The occurrence of 6^(3 can be eliminated for geodesics in E (s). The left hand side 
of (222) is zero when applied to the barred variables and (pa[ 3 = 0 for x G T>(s) 
by construction, so 

ea /3 = V(„r^ forxeE(s). (274) 

If (273) is multiplied by v,^ and restricted to geodesics in E (s) then use of (274) turns 
it into a homogeneous equation in the variable v,^G'^^dXa/ds. This variable is zero 
at X = z(5') as dXa/ds = 0 at this point, so it is zero throughout E(^'). This can be 
put back into the right hand side of (273) itself to turn it on E (^') into a homogeneous 
equation for a'^^dXa/ds, a function of x whose value is a vector at z. Once again 



d 

du 






dXg 

ds 
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this vanishes at x = and so this vector, and hence dXa/ds itself, vanishes for 
all V G 

As discussed above, this completes the proof that eap(s,x) and eap(s,x) are 
equal on each T>(s). Because the constructed ipa (3 satisfies (263), (199) reduces to 
(232) due to (206) and (230). This justifies (232) for all and so in particular for 
those constructed from a ipa(3 that does not satisfy (263). Subtraction of (232) from 
the full defining equation (199) then shows that (233) is also valid. 

Consider now a free choice of es^pair{x), x) multiplied by a function that is 
constant on each T>(s). This multiplier can be brought outside the ^-space integral 
of (203) and the hypersurface integral of (232) as both have been shown to depend 
on the value of e^pis, x) only through its value on These are both expressions 
for so as the multiplier is arbitrary, the 5' -integrands must be equal. This 

proves (234) and completes the proof of uniqueness for the results of Sect. 19. 



24 Higher Moments as Integrals 



The one issue standing in the way of extracting integral expressions for the moments 
J from (234) is the function This is defined as the solution of (225) corre- 
sponding to initial conditions (228) with a,^ = 0 and b,^x = 0. In practice it is easier 
to work with an alternative specification. The barred variables of (273) also satisfy 
(225) since the hatted and unhatted ones do so separately, and = 0. Equation 

(273) is in fact a first integral of (225) in the case a,^ = 0, so also satisfies it, 

giving 

— = 2u~^vxcr'^^cr^^cr^cr^ea^ps (275) 

along all affinely parameterised geodesics x(u) through z(s) with z{s) = x(0). 
Indices a, (3,^, .. . here refer to the generic point x(u). Note that since b,^x = 0. 
'ipaixiu)) = 0(u^) as w ^ 0, so the singularity at w = 0 is only apparent. 

Now the Fourier inversion formula with (150) and (152) gives 

x) = j k) (276) 

where k ■ a = k\a^. Since a'^(x(u)) = ua'^(x{l)) for the affinely parameterised 
geodesic x(u), (276) can be put into (275) and the equation integrated to give 



1pa(s,x) = 



(2:it)4' 



/ OO 

19 ^ Ex^iiyp ^ ^ 

n 
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where 
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a^p^a^^u^ du 



(278) 
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with the arguments of the cr’s in (278) being (z,x(u)). The 0’s are propagators 
dependent solely on the geometry and such that 

(279) 



in a flat spacetime for all n. 

If (276) and (277) are put into (234), the result may be compared with (204) to 
give 

= (- 1 )” [ 

J'Eis) \ 

+ ^— (280) 
n + 1 / 

Here, as everywhere, n is the number of indices in the set marked by dots, here 
cr • • • p, and nested square brackets for antisymmetrisation are taken in order from 
left to right, so the pairs z//i and Xk are antisymmetrised. The factor (— l)'^ arises 
from the minus sign in (150); in a flat spacetime is minus the position vector from 
z(s) to the point v on Ti(s). 

This is the flnal expression for the quadrupole and higher moments as integrals 
of the energy-momentum tensor. They satisfy the symmetry conditions (94) and 
(95) identically and the orthogonality conditions (99) as n,^(s)a'^(z(s), x) = 0 for 
X e Ti(s) by the deflnition of this hypersurface. 

Mathisson took 5' as proper time along L and his orthogonality conditions used 
treatment of the quadrupole moment he neglected components that 
have no classical analogue. In doing so he is effectively considering an energy- 
momentum tensor of the form where /i(x) is a mass density. Neglect 

of octopole and higher moments means that all tensors appearing in the expression 
for the quadrupole can be taken as evaluated at z(^’), so that effectively also 
and the distinction between indices at v and z can be ignored. If these assumptions 
are put into (280) for the case ^ = 0 it gives 

jupXK ^ ( 281 ) 

where cIEq, in agreement with Mathisson’s expression (22). 

This is the last piece in the jigsaw required to compare the quadrupole torque 
derived here with that of Mathisson. The required comparison is between his result 
(23) and the form (138) of the equation of motion for spin. The torque in the 
quadrupole approximation is given by (188) with (189). The Veblen extension 
appearing in this is given by (196) and if is taken as given by (281) then it is seen 

that Mathisson’s result and ours are in agreement. It was noted in Sect. 6 that although 
Taub [23] and Madore [24] also developed quadrupole approximations, their work 
had algebraic errors. When those are corrected it is found that their expressions agree 
with one another and that both the force and torque of their results agrees with the 
quadrupole approximation obtained here. 
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25 Summary and Discussion 



The multipole approach to equations of motion in general relativity originated with 
an ambitious paper by Myron Mathisson in 1937 entitled, in translation, “A New 
Mechanics of Material Systems”. This paper laid down a general program but carried 
through the details only in an approximation in which only monopole and dipole 
moments are retained, with a partial treatment also of the quadrupole moment. 

He continued to work on this topic until his untimely death in 1940, after which 
this subject went dormant until Mathisson ’s results were rediscovered, by a very 
different and much less satisfactory method, by Papapetrou in 1951. Over the next 
twenty years various authors, including myself, attempted to improve and extend the 
methods of Mathisson and Papapetrou but little advance was made. In particular the 
quadrupole case remained stubbornly intractable. 

Further progress became possible only with the realisation that the supposed 
improvements of those twenty years were in fact not improvements at all, that they 
were leading to dead ends. Mathisson alone had been on the right track. There were 
important issues left unresolved by his work but the way forward was to identify 
these and find ways to handle them, not to go blindly off in other directions in the 
hope that they would prove fruitful. With this realisation, I was able to resolve those 
issues and take Mathisson’s program to completion. The results were published in 
three papers [25, 27, 28] over the period 1970-1974. 

This paper began by summarising Mathisson’s work of 1937, identifying the 
issues it leaves and outlining the work of Papapetrou and the subsequent attempts 
to improve on his and Mathisson’s approaches. It then presented my developments 
of 1970-1974 by a route that follows Mathisson’s program as closely as possible. 
This differs somewhat from the way that it was presented in the original papers and 
in my opinion is a substantial improvement, being better motivated and easier to 
understand. This is yet one more illustration of the foresight that Mathisson had, 
right from the start of his work. 

The results from the completion of Mathisson’s program show that an extended 
body can be described completely by an infinite set of moment tensors consisting of 
a momentum vector an antisymmetric spin tensor together with quadrupole 
and higher moment tensors. These are defined relative to a timelike world line L and 
a field of timelike unit vectors chosen along L that enters the moment definitions 
through orthogonality relations satisfied by the octopole and higher moments. 

This description is equivalent to the continuum description by a symmetric energy- 
momentum tensor in that either description completely determines the other. 
The moments initially have an implicit definition via an equation that expresses 
in terms of them. The development following Mathisson’s program then leads to 
explicit expressions for them as integrals of over spacelike cross-sections of 
the body. The hypersurfaces of integration are uniquely determined. For any point 
z on L the corresponding hypersurface is that formed by all geodesics through L 
orthogonal to n^. This shows that the set of moment definitions must be taken in its 
entirety for any of them, even momentum and spin, to become well defined. 
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The energy-momentum tensor satisfies the covariant conservation equation 

= 0. (282) 
The corresponding property for the moments is the two equations of motion 

^ (283) 

and 

(284) 

d^ 

where and are the force and torque exerted on the body by the gravita- 
tional field. These are determined by the quadrupole and higher moments. The sign 
convention for the curvature tensor is given by (2). 

There are no other restrictions on the moments. They satisfy (283) and (284) as a 
consequence of (282) but conversely, given a set of moments that satisfy (283) and 
(284) then the energy-momentum tensor that they determine necessarily satisfies 
(282). 

These results hold for an arbitrary choice of L and vector field along it. For 
such a choice, (283) and (284) are really only evolution equations rather than equa- 
tions of motion. They become true equations of motion only when supplemented by 
conditions that determine L as the world line of a uniquely defined centre of mass 
of the body. The conditions to achieve this are 



o 

II 




(285) 


and 

= 0. 




(286) 


Note that (285) cannot be replaced by a simple statement to choose as the unit 

vector parallel to since itself depends on through the hypersurfaces of 

integration, as discussed above. 

The 2^^-pole moment for ^ > 2 is described by a tensor J with {n -h 2) indices 
that satisfies the symmetry and orthogonality conditions 


jC--e6-fpa _ j(C--e)[6-f][Pa] jC--e6[-fpa] _ q 


n >2, 


(287) 


jC-[eSjWa ^ Q ^ ^ 3 




(288) 


and 

for„>3. 




(289) 
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This shows, in particular, that it has the symmetries of the curvature tensor on their 
last four indices. Note that n here continues to refer to the tensor being that for the 
2^^-pole moment. Elsewhere in the paper it is the number of indices in the set elided 
by dots, here - e, namely {n — 2) in the present context which is why the range 
of n appears to differ from that in (94), (95) and (99) from where these results are 
taken. As a result of these conditions the 2^^-pole moment tensor has (n + 2) (3^ — 1) 
linearly independent components. For the quadrupole this gives 20, which can be 
checked as it has the same symmetries as the curvature tensor for which this count 
is well known. 

Explicit expressions as integrals of are developed for the momentum and spin 
in Sect. 19 and for the quadrupole and higher moments in Sect. 24. The gravitational 
force and torque are derived in an exact form in Sect. 15 and in any desired multipole 
approximation in Sect. 16. In the quadrupole approximation these give 

Fa = \j^'^^"^aRp^5e and | . (290) 

The completion of Mathis son’s program is not the end of the story. The equations 
of motion in their exact form are satisfied exactly by an extended body but the 
gravitational field that occurs in it includes that produced by the body itself. Even for 
bodies of modest size, this self field dominates the external field in the neighbourhood 
of the body. This is certainly true for the Earth but is also for much smaller bodies. The 
multipole approximation depends on the size of the body being small in comparison 
with the length scale of the gravitational field, but the self field means that this is not 
normally true. 

What is normally done is to ignore the self field by treating the body of interest 
as a test body, one whose size or mass is such that its contribution to the field can 
be neglected. There are two problems with treating a real body in this way. One is 
that it is assuming the field equations to be linear, in some suitable sense, so that the 
self field can be subtracted to leave an external field to be used in the equations of 
motion. The other is that it is assuming that the self field does not affect the motion, 
that the self force vanishes. This is ignoring such effects as radiation reaction. 

What needs to be done is to give a precise meaning to the self field and the 
subtraction process, and so to the external field, in such a way that (282) still holds 
for the energy-momentum tensor of the body in the external field. The external field 
will not satisfy the Einstein field equations inside the body but this is not relevant to the 
results of the Mathisson program. They depend solely on the conservation equation 
(282). This problem will be addressed by others elsewhere in these proceedings. 
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Multipolar Test Body Equations of Motion 
in Generalized Gravity Theories 



Yuri N. Obukhov and Dirk Puetzfeld 



Abstract We give an overview of the derivation of multipolar equations of motion 
of extended test bodies for a wide set of gravitational theories beyond the standard 
general relativistic framework. The classes of theories covered range from simple 
generalizations of General Relativity, e.g., encompassing additional scalar fields, to 
theories with additional geometrical structures which are needed for the description 
of microstructured matter. Our unified framework even allows to handle theories 
with nonminimal coupling to matter, and thereby for a systematic test of a very 
broad range of gravitational theories. 



1 Introduction 

In this work we present a general unified multipolar framework, which enables us to 
derive equations of motion of extended test bodies for a wide range of gravitational 
theories. The framework presented here can be applied to theories which signifi- 
cantly go beyond General Relativity (GR), and range from the most straightforward 
extensions of GR, like scalar- tensor theories, to theories with additional geometrical 
structures and nonminimal coupling. 

The multipolar method which we employ here can be thought of as the direct 
generalization of the ideas pioneered by Mathisson [1], Papapetrou [2], and Dixon 
[3-6] to the case of generalized gravity theories. As sketched in Fig. 1, the main aim 
of such methods is to find a simplified local description of the motion of extended test 
bodies in terms of a suitable set of multipolar moments, which catches the essential 
properties of the body at the chosen order of approximation. 
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Fig. 1 General idea behind multipolar approximation schemes: The world- tube E of a body is 
replaced by a representative world-line L, whereas the original energy-momentum tensor is 
substituted by a set of multipole moments m^^' " along this world-line. Such a multipolar description 
simplifies the equations of motion. This is achieved by consideration of only a finite set of moments. 
Different fiavors of multipolar approximation schemes exist in the literature, in this work we define 
the moments a la Dixon in [3] 



In this work, we use a covariant multipolar description, based on Synge’s world- 
function formalism [7, 8]. The multipolar moments to be introduced, can be viewed 
as a direct generalization of the moments first introduced by Dixon in [3]. 

Central to the derivation of the equations of motion, by means of a multipolar 
method, is the knowledge of the corresponding conservation laws of the underlying 
gravity theory. In General Relativity, the starting point of all methods mentioned 
so-far is the conservation of the (symmetric) energy-momentum 

= ( 1 ) 

Many generalized gravity theories allow for a more detailed description of matter 
than standard GR, e.g., also taking into account internal matter degrees of freedom as 
source of the gravitational field. Consequently the conservation laws of such theories 
are more complex, and (1) has to be replaced by a suitable generalization. 

Generalized gravity A metric and connection are the two fundamental geometrical 
objects on a spacetime manifold. They play an important role in the description of 
gravitational phenomena in the framework of what can be quite generally called an 
Einsteinian approach to gravity. The principles of equivalence and general coordinate 
covariance are the cornerstones of this approach. As Einstein himself formulated, 
the crucial achievement of his theory was the elimination of the notion of inertial 
systems as preferred ones among all possible coordinate systems. 

In Einstein’s theory, gravitation is associated with the metric tensor alone. Nev- 
ertheless, it is worthwhile to stress that Einstein clearly understood the different 
physical statuses of the metric and the connection (Appendix II in [9]): 
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[. . .] at first Riemannian metric was considered the fundamental concept on which the general 
theory of relativity and thus the avoidance of the inertial system were based. Later, however, 
Levi-Civita rightly pointed out that the element of the theory that makes it possible to avoid 
the inertial system is rather the infinitesimal displacement field The metric or the 
symmetric tensor field gik which defines it is only indirectly connected with the avoidance 
of the inertial system insofar as it determines a displacement field. 

There exists a variety of gravitational theories that generalize or extend the physi- 
cal and mathematical structure of GR. Among these theories there are large classes of 
so-called f{R) models, and of theories with nonminimal coupling to matter; they are 
developed in particular in the context of relativistic cosmology (but not only there), 
see [10-14]. The so-called Palatini approach represents another class of widely dis- 
cussed theories in which the metric and the connection are treated as independent 
variables in the action principle [15-17]. Another early example of a generalized 
gravity theory with balance laws different from Einstein’s gravity can be found in 
[18, 19]. Last but not least, we should mention the vast family of the gauge gravity 
theories constructed using a Yang-Mills type of approach [20, 21]. 

Since our main aim is to present a unified multipolar framework for a very wide 
range of gravity theories, the theory underlying our analysis has to be sufficiently 
general. In this work we choose metric-affine gravity (MAG) as the foundation for 
the derivation of the equations of motion of extended deformable test bodies (Fig. 2). 



Fig. 2 In contrast to GR, MAG also allows to take into account the microstructural properties of 
matter (spin, dilation current, proper hypercharge). These additional currents couple to the post- 
Riemannian geometric degrees of freedom of the underlying spacetime. Our unified description of 
extended test bodies allows to cover a wide range of gravitational theories and matter models 
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In MAG, matter is characterized by three fundamental Noether currents — the 
canonical energy-momentum current, the canonical hypermomentum current, and the 
metrical energy-momentum current. These objects satisfy a set of conservation laws 
(or, more exactly, balance equations). In view of the multi-current characterization 
of matter in metric-affine gravity, we develop a general approach which is applicable 
to an arbitrary set of conservation laws for any number of currents. The latter can 
include gravitational, electromagnetic, and other physical currents if they are relevant 
to the model under consideration. In particular, our approach is flexible enough to be 
applied to the case in which there is a general nonminimal coupling between gravity 
to matter. Our presentation here is mainly based on the original works [22-24]. 

Structure of the paper The structure of the paper is as follows: In Sect. 2 we give an 
overview of the metric-affine theory of gravity. In particular we cover its geometrical 
and dynamical aspects. This is followed by a general Lagrange-Noether analysis in 
Sect. 3. The results of this general analysis are then used to derive the conservation 
laws of a metric-affine gravity with nonminimal coupling in Sect. 4. In Sect. 5 a unified 
multipolar framework is presented. This framework is then used Sect. 6 to derive the 
general equations of motion of extended test bodies in nonminimal metric-affine 
gravity. Several specializations to other gravity theories are discussed. Furthermore, 
in Sect. 7 the equations of motion in scalar-tensor theories are worked out. Section 8 
contains a summary of our results. 

Our conventions are those of [25]. In particular, the basic geometrical quantities 
such as the curvature, torsion, and nonmetricity are defined as in [25], and we use 
the Latin alphabet to label the spacetime coordinate indices. Furthermore, the metric 
has the signature (-h, It should be noted that our definition of the metrical 

energy-momentum tensor differs by a sign from the definition used in [12, 14, 26]. 
A summary of our notation and conventions, as well as additional material for com- 
parison to previous works can be found in the appendix. 



2 Metric- Affine Gravity Primer 

Metric-affine gravity [25] is a natural extension of Einstein’s general relativity the- 
ory. It is based on gauge-theoretic principles [20, 21], and it takes into account 
microstructural properties of matter (spin, dilation current, proper hypercharge) as 
possible physical sources of the gravitational field on an equal footing with macro- 
scopic properties (energy and momentum) of matter. The formalism of MAG makes 
it possible to study all the above-mentioned alternative theories in a unified frame- 
work. The corresponding spacetime landscape [27] includes as special cases the 
geometries of Riemann, Riemann-Cartan, Weyl, Weitzenbock, etc. (cf. Fig. 3). 

The standard understanding of metric-affine gravity is based on the gauge- 
theoretic approach for the general affine symmetry group. In this framework, we 
can naturally distinguish the kinematics of the gravity theory and its dynamics. The 
kinematics embraces all aspects that are related to the description of the fundamental 
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Fig. 3 Different spacetime 
types as special cases of a 
general metric-affine 
spacetime {Rm^ 7 ^ 0, 

Tki^ 7^ 0, Qkij ^ 0). The 
abbreviations R (curvature), 
T (torsion), and Q 
(nonmetricity) over the 
arrows denote the vanishing 
of the corresponding 
geometrical object 




variables, their mathematical properties and physical interpretation. The dynamics 
studies the choice of the Lagrangian and the field equations. We collect in this section 
all the relevant material. Although the Lagrange-Noether machinery is deeply interre- 
lated with both kinematics and dynamics of the theory, we will discuss it in a separate 
section which will ultimately underlie the derivation of the multipole equations of 
motion. 



2.1 Kinematics of Metric-Affine Gravity 

We model spacetime as a four-dimensional smooth manifold. In this study we are 
not interested in the global (topological) aspects, and we will confine our attention 
only to local issues. The local coordinates x\ i = 0, 1, 2, 3, are introduced in the 
neighborhood of an arbitrary point of the spacetime manifold. The geometrical (grav- 
itational) and physical (material) variables are then fields of different nature (both 
tensors and nontensors) over the spacetime. They are characterized by their compo- 
nents and transformation properties under local diffeomorphisms x^ + Sx\ 

where 

6x^=^\x). ( 2 ) 

The four arbitrary functions (v) parametrize an arbitrary local diffeomorphism. 



2.1.1 Geometrical Variables 

In the metric-affine theory of gravity, the gravitational physics is encoded in two 
fields: the metric tensor gij and an independent linear connection Vki -^ . The latter is not 
necessarily symmetric and compatible with the metric. From the geometrical point 
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of view, the metric introduces lengths and angles of vectors, and thereby determines 
the distances (intervals) between points on the spacetime manifold. The connection 
introduces the notion of parallel transport and defines the covariant differentiation 
Wk of tensor fields. 

Under the spacetime diffeomorphisms (2), these geometrical variables trans- 
form as 



Sgij = -(di^'^)gkj-{djC’^)gik, (3) 

SVkiJ = - ViiJ - Vki^ + {di^) Vk/ - 4^^' . (4) 

In general, the geometry of a metric-affine manifold is exhaustively characterized 
by three tensors: the curvature, the torsion and the nonmetricity. They are defined 
[27] as follows: 



Rkii^ := OkFuJ - diVkiJ + rjru^ - riJVki\ (5) 

Tk/ :=Tk/ -Vij/, ( 6 ) 

Qkij •= “ ^k9ij = —dkQij + ^k/gij + ^k/dih ( 7 ) 

The curvature and the torsion tensors determine the commutator of the covariant 
derivatives. For a tensor di...di of arbitrary rank and index structure: 

k I 

+ Rgbe^' A^^-^-^^di...di - Rqbdj^ A^^"'^^di...e...dr (^) 

i=\ 7=1 

The Ricci tensor is introduced by Rij := Rkij^, and the curvature scalar is R := g^^Rij. 

A general metric-affine spacetime {Rm^ / 0, / 0, Qkij 7 ^ 0) incorporates 

several other spacetimes as special cases, see Fig. 3 for an overview. The Riemannian 
connection Fy^/ is uniquely determined by the conditions of vanishing torsion and 
nonmetricity which yield explicitly 

= -g^\djgki + dkgij - digkj). (9) 

Here and in the following, a tilde over a symbol denotes a Riemannian object (such 
as the curvature tensor) or a Riemannian operator (such as the covariant derivative) 
constructed from the Christoffel symbols (9). The deviation of the geometry from 
the Riemannian one is then conveniently described by the distortion tensor 

Nk/ :=fk/ -Tk/. ( 10 ) 

The system ( 6 ) and (7) allows to find the distortion tensor in terms of the torsion and 
nonmetricity. Explicitly: 
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V = - + r, + rV) + - G./ - cy ). (11) 

Conversely, one can use this to express the torsion and nonmetricity tensors in terms 
of the distortion, 



V = -2%,y, (12) 

Qkij — ( 12 ) 

Substituting (10) into (5), we find the relation between the non-Riemannian and the 
Riemannian curvature tensors 

Radc” = Radc” - yaNdc” + ^dNac” + Nan” N^c" ~ N^n” Nac"^ . (14) 

Applying the covariant derivative to (5)-(7) and antisymmetrizing, we derive the 
Bianchi identities [27]: 



'^[nRkl]i-^ = T[kl^Rn]mi\ ( 15 ) 

^[nTkl] ~ R[kln] H” Tj^]m ■> (16) 

^[nQk]ij — ^nk{ij)- ( 12 ) 



2.1.2 Tensors, Densities, and Co variant Differential Operators 

Along with tensors, an important role in physics is played by densities. A fundamental 
density is constructed from the determinant of the metric, g = detgij. Under 
diffeomorphisms (2) it transforms as 

^^9 = -m)^g. (18) 

This is a direct consequence of (3). From any tensor one can construct a 

density " . Although in this paper we will encounter only such 

objects, it is worthwhile to notice that not all densities are of this type — see the 
exhaustive presentation in the book of Synge and Schild [28]. 

There are two kinds of covariant differential operators on the spacetime manifold, 
depending on whether the connection is involved or not. The Lie derivative Cq is 
defined along any vector field Q and it maps tensors (densities) into tensors (densities) 
of the same rank. Let us recall the explicit form of the Lie derivative of the metric 
and the distortion: 



^C9ij — C^dk9ij + (diC^)gkj + (djC^)gik, 

C^Nk/ = CdnNk/ + (dkC)Nn/ + (djC)Nkn' ~ (dnC)Njg\ 



(19) 

( 20 ) 
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In contrast, a covariant derivative raises the rank of tensors (densities) and 
it is determined by the linear connection Vk/ . Moreover, there are different covari- 
ant derivatives which arise for different connections that may coexist on the same 
manifold. 

A mathematical fact is helpful in this respect: Every third rank tensor Xj^f defines 
a map of one connection into a different new connection 

rk/ + (21) 

There are important special cases of such a map. One example is obtained for Xj^f = 
Nk / : then the connection is mapped into the Riemannian Christoffel symbols, 
Tk/ = Tk/ + Nkj\ in accordance with (10). 

Another interesting case arises for Xj^f = Tjk\ The result of the mapping 

= ^k/ + Tjk^ = rk/ + - Vk/ = Vjk\ ( 22 ) 

is then called a transposed connection, or associated connection, see [29, 30]. 

The importance of the transposed connection is manifest in the following obser- 
vation. Although the Lie derivative is a covariant operator — this is not apparent since 
it is based on partial derivatives — one can make everything explicitly covariant by 
noticing that it is possible to recast (19) and (20) into equivalent form 

Cc9ij = C^^kQij + + (V.CbftVfc, (23) 

CcNk/ = CVnNk/ + i^kONn/ + i^jONkJ - (24) 

By the same token we can “covariantize” the Lie derivatives for all other tensors of 
any structure and of arbitrary rank. 

A more nontrivial (and less known) fact is that we can define the Lie deriva- 
tives also for objects which are not tensors. In particular, the Lie derivative of the 
connection then reads [29] : 



= ^k^jC - Rki/C^. (25) 

This quantity measures the noncommutativity of the Lie derivative with the covariant 
derivative 



(C^Vk - 

k I 

= - X(^crw/)V‘-"^rf,...ft...4. (26) 

/=! 7-1 

The connection , the transposed connection T y^/ , and the Riemannian con- 
nection Ty^/ define the three respective covariant derivatives: Vy^, Vy^, and Vy^. The 



Multipolar Test Body Equations of Motion in Generalized Gravity Theories 



75 



covariant derivative defined by the Riemannian connection is conventionally denoted 
by a semicolon 

We will assume that these differential operators act on tensors. In addition, we 
will need the covariant operators that act on densities. For an arbitrary tensor density 
. 7... introduce the covariant derivative 



:= dn^^iJ- + (27) 

which produces a tensor density of the same weight. We denote a similar differential 
operation constructed with the help of the Riemannian connection by 

:= (28) 

When "■ = we find 

v„ 05" (29) 

v„ 05" (30) 

where we introduced a modified covariant derivative 

Vi :=Vi+Nki'^. (31) 



2.1.3 Matter Variables 

We will not specialize the discussion of matter to any particular physical field. It will 
be more convenient to describe matter by a generalized field . The range of the 
indices A, B, ... is not important in our study. However, we do need to know the 
behavior of the matter field under spacetime diffeomorphisms (2): 

= -(5;Ch(<T^B)i'V’®. (32) 

Here (ct'^b)/ are the generators of general coordinate transformations that satisfy 
the commutation relations 

b)i^ - c)l^ b) / = {ct^b)i^ - {ct^b ) • (33) 

We immediately recognize in (33) the Lie algebra of the general linear group 
GL(4, R). This fact is closely related to the standard gauge-theoretic interpretation 
[25] of metric-affine gravity as the gauge theory of the general affine group GA(4, R), 
which is a semidirect product of spacetime translation group times GL(4, R). 
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The transformation properties (32) determine the form of the covariant and the 
Lie derivative of a matter field: 

:= - Tki^ V’®, (34) 

:= (35) 

The commutators of these differential operators read 

(V^V/ - = - Rkl/(a^B)di’^ - (36) 

(C^Vk - VkCO^^ = - (37) 

2.1.4 Symmetries in MAG: Generalized Killing Vectors 

As is well known, symmetries of a Riemannian spacetime are generated by Killing 
vector fields. Each such field defines a so-called motion of the spacetime manifold, 
that is a diffeomorphism which preserves the metric gtj. 

Suppose Q is a Killing vector field. By definition, it satisfies 

V/0+V,-0=0. (38) 

By differentiation, we derive from this the second covariant derivative 

%^jCk = Rjk‘Ci- (39) 

Apply another covariant derivative and antisymmetrize: 

V[„V,]V^CA: = '^[n(R\jk\i]0)- (40) 

After some algebra, the last equation is recast into 

C^nRijkl + Rnjkl^iC + %nkl^jC + Rijnl%C + Rykn^lC = 0- (41) 

The Eqs. (38), (39) and (41) have a geometrical meaning: 

Ccgij = 0, (42) 

£cr/ = 0, (43) 

C(;Rijki = 0. (44) 

That is, the Lie derivatives along the Killing vector field ( vanish for all Riemannian 
geometrical objects. Moreover, one can show that the same is true for all higher 
covariant derivatives of the Riemannian curvature tensor [31] 



. . . "^nN^ijkl) = 0 - 



(45) 
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Let us generalize the notion of a symmetry to the metric-affine spacetime. We take 
an ordinary Killing vector field ( and postulate the vanishing of the Lie derivative 

jCcNk/ = 0 (46) 

of the distortion tensor. Combining this with (11) and (43), we find an equivalent 
formulation 

Ccgij = 0, (47) 

= 0. (48) 

We call a vector field that satisfies (47) and (48) a generalized Killing vector of 
the metric-affine spacetime. By definition, such a C generates a diffeomorphism of 
the spacetime manifold that is simultaneously an isometry (47) and an isoparal- 
lelism (48). 

Since the Lie derivative along a Killing vector commutes with the covariant deriv- 
ative, see (26), we conclude from (14) and (44) that the generalized 

Killing vector leaves the non-Riemannian curvature tensor invariant 



C^Rki/ = 0 . 



(49) 



It is also straightforward to verify that 

=0 (50) 

for any number of covariant derivatives of the curvature. 

Later we will show that generalized Killing vectors have an important property: 
they induce conserved quantities on the metric-affine spacetime. 



2.2 Dynamics of Metric-Affine Gravity 

The explicit form of the dynamical equations of the gravitational field is irrelevant 
for the conservation laws that will form the basis for the derivation of the test body 
equations of motion. However, for completeness, we discuss here the field equations 
of a general metric-affine theory of gravity. The standard understanding of MAG 
is its interpretation as a gauge theory based on the general affine group GA(4, R), 
which is a semidirect product of the general linear group GL(4, R), and the group 
of local translations [25]. The corresponding gauge-theoretic formalism generalizes 
the approach of Sciama and Kibble [32, 33]; for more details about gauge gravity 
theories, see [20, 21]. In the standard formulation of MAG as a gauge theory [25], the 
gravitational gauge potentials are identified with the metric, coframe, and the linear 
connection. The corresponding gravitational field strengths are then the nonmetricity, 
the torsion, and the curvature, respectively. 
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Here we use an alternative formulation of MAG, in which gravity is described 
by a different set of fundamental field variables, i.e. the independent metric gij and 
connection , see [15-17, 34-40], for example. It is worthwhile to compare the 
field equations in the different formulations of MAG, and in particular, it is necessary 
to clarify the role and place of the canonical energy-momentum tensor as a source 
of the gravitational field. Since one does not have the coframe (tetrad) among the 
fundamental variables, the corresponding field equation is absent. Here we demon- 
strate that one can always rearrange the field equations of MAG in such a way that 
the canonical energy-momentum tensor is recovered as one of the sources of the 
gravitational field. 

Assuming standard minimal coupling, the total Lagrangian density of interacting 
gravitational and matter fields reads 

£ = 2J(5y, Rijk\ NkiJ) + £mA9ij, V’"', (51) 



In general, the gravitational Lagrangian density ^ is constructed as a diffeomorphism 
invariant function of the curvature, torsion, and nonmetricity. However, in view of the 
relations (12) and (13), we can limit ourselves to Lagrangian functions that depend 
arbitrarily on the curvature and the distortion tensors. The matter Lagrangian depends 
on the matter fields and their covariant derivatives (34). 

The field equations of metric-affine gravity can be written in several equivalent 
ways. The standard form is the set of the so-called “first” and “second” field equations. 
Using the covariant derivative for densities (27), the field equations are given by 



- ^k' = (52) 

j + - e"',- = 6'/. (53) 

Here the generalized gravitational field momenta densities are introduced by 



^kii. 



- 2 - 









523 



dRuij ^ dTktJ 

and the gravitational hypermomentum density 



^ki. _^ki. _ ^ki. ^ _ 



:= - 



523 



523 



dNkii ■ 



(54) 



(55) 



Furthermore, the generalized energy-momentum density of the gravitational field is 



<tk' = 4 ® + + Rkln"‘^“"m. 



(56) 
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The sources of the gravitational field are the canonical energy-momentum tensor and 
the canonical hypermomentum densities of matter, respectively: 





(57) 


k . 9£mat ^ i 

® ' ■= ar,,J = * ■ 


(58) 



It is straightforward to verify that instead of the first field equation (52), one can use 
the so-called zeroth field equation which reads 



2 



^9ij 



= ej. 



(59) 



On the right-hand side, the matter source is now represented by the metrical energy- 
momentum density which is defined by 






(60) 



The system (52) and (53) is completely equivalent to the system (53) and (59), and 
it is a matter of convenience which one to solve. 



2.3 From Densities to Tensors 

In actual metric-affine gravity models the Lagrangian densities are constructed in 
terms of the Lagrangian functions: £ = £mat = \/^^mat- 

Accordingly, we find for the gravitational field momenta 

(61) 

and j = j. The gravitational sources (57), (58) and (60) are rewritten 

in terms of the canonical energy-momentum tensor, the canonical hypermomentum 
tensor, and the metrical energy-momentum tensor, respectively: 

6'/ = V^A'/, tij = ^tij. (62) 

The usual spin arises as the antisymmetric part of the hypermomentum, 

't/ := (63) 

whereas the trace is the dilation current. The symmetric traceless part 

describes the proper hypermomentum [25]. 
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As a result, the metric-affine field Eqs. (52) and (53) are recast into 




(64) 



(65) 



2.3.1 Example: Matter Model with Microstructure 

In order to give an explicit example of physical matter with micro structure, we recall 
the hyperfiuid model [41]. This is a direct generalization of the general relativistic 
ideal fluid variational theory [42, 43] and of the spinning fluid model of Weyssenhoff 
and Raabe [44, 45] . Using the variational principle for the hyperfiuid [41], one derives 
the canonical energy-momentum and hypermomentum tensors: 



where is the 4- velocity of the fluid and p is the pressure. Fluid elements are 
characterized by their micro structural properties: the momentum density Pk and the 
intrinsic hypermomentum density Jm^. 



3 General Lagrange-Noether Analysis 

As a first step, we notice that the gravitational (geometrical) and material variables 
can be described together by means of a multiplet, which we denote by = 
(gij, We do not specify the range of the multi-index J at this stage. The 

matter fields may include, besides the true material variables, also auxiliary fields 
such as Lagrange multipliers. With the help of the latter we can impose various 
constraints on the geometry of the spacetime. Furthermore, we can use the Lagrange 
multipliers to describe models in which the Lagrangian depends on arbitrary-order 
covariant derivatives of the curvature, torsion, and nonmetricity. Then the general 
action reads 




( 66 ) 




(67) 




( 68 ) 



where the Lagrangian density £ = £(0‘^,9/0'^) depends arbitrarily on the set of 
fields and their first derivatives. 
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Our aim is to derive Noether identities that correspond to general coordinate 
transformations. However, it is more convenient to start with arbitrary infinitesimal 
transformations of the spacetime coordinates and the matter fields. They are given 
as follows: 



^ (v) = + 5x ^ , 

^ <!>'■' (x') = <I)-'(x) +<5<I)-'(x). 



(69) 

(70) 



Within the present context it is not important whether this is a symmetry transfor- 
mation under the action of any specific group. The total variation (70) is a result of 
the change of the form of the functions and of the change induced by the transfor- 
mation of the spacetime coordinates (69). To distinguish the two pieces in the field 
transformation, it is convenient to introduce the substantial variation: 

:= d)'-'(x) - <t-'(x) = 5^^ - 5x^dk^^ . (71) 



By definition, the substantial variation commutes with the partial derivative, 5(s)di = 
diS(s). 

We need the total variation of the action: 



SI 



j d‘^xSZ-\-5{d‘^x)2)^. 



(72) 



A standard derivation shows that under the action of the transformation (69) and (70) 
the total variation reads 



/' 



51 = / 



■ 5Z 






+ di I £,Sx‘ + 









5(s)^ 



')]■ 



Here the variational derivative is defined, as usual, by 



( 5 £ 

■ 




(73) 



(74) 



3.1 General Coordinate Invariance 



Now we specialize to general coordinate transformations. For infinitesimal changes 
of the spacetime coordinates and (matter and gravity) fields (69) and (70) we have 
x' ^ x' + 5x' , Qij Qij + Sgij, Tki^ Fh-' + STui , and ^ + (5?/)^, where 

variations are given by (2)-(4), and (32). Substituting these variations into (73), and 
making use of the substantial derivative definition (71), we find 



51 = - 







(75) 
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where explicitly 



“ 5gij ^ ^ dkdiVi/‘ 



ddiTin’” 



+ 5, 



J dZ ^ ^ , 0£ ^ , ^ 

^ I dkQmn + / 4 dk"^ 

\ddi~ 






i Qmn 

5Z 



ddi^^ 



— 2-1 — Qkj + YJJ (o-^B)k + 



-(51£) 



Sg, 



dZ 

^^igmn 

dZ 



5ip^ 

^kgmn ^ i I ^ 



dZ 

ddjQi, 



dZ 

ddiip^ 

~gnk 



dkip"^ - S[z 



dZ 

ddjxl)^ 



Qk‘^ = 



= 



druJ 
dZ 

ddnVu'" 

dZ 



j dZ j dZ : 

Zlk^ + ,. Tkl^ - — — r Tij + 



dVa^ 



dTuk ‘J 



i(T^B)k 

dZ 

ddiTin 



V') 



■ dkVin 



„ dZ f> r 

(JnZkt + T-i m ^nZlk t-> t ^nZln 



ddnTu^ 






dZ 



+ 



dZ 



ddnVln 

■p m 

rJ kl 



ddatp^'^" ’ dVdj)^ ddaTj^r 

. dZ dZ ^ ^ dZ ^ 

” 1 ” 2 Qkn ^ ^Ik ^ ^Ih 



^^{i9j)n 

dZ 



ddaT 



'O’ A \i\j)" 



dd^iT\in\^ 



dd{pTij) 






(76) 



(77) 

(78) 

(79) 



If the action is invariant under general coordinate transformations, 51 = 0, in view of 
the arbitrariness of the function and its derivatives, we find the set of four Noether 
identities: 

= 0, = 0, = 0, = 0. (80) 



General coordinate invariance is a natural consequence of the fact that the action 
(68) and the Lagrangian £ are constructed only from covariant objects. Namely, 
£ = £(t/;^, Qij, Rkii^ , Nk/) is a function of the metric, the curvature (5), the 

torsion (6), the matter fields, and their covariant derivatives (34). Denoting 



:= 



dZ 

dRijk‘ ’ 



gh 



dZ 

dNi/ ’ 



we find for the derivatives of the Lagrangian 



dZ 

dZ 






(81) 



( 82 ) 
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dZ 

ddkOij 






(84) 



As a result, we straightforwardly verify that = 0 and = 0 are indeed 
satisfied identically. 

Using (82) and (83), we then recast the two remaining Noether identities (76) and 
(77) into 



^ ^ - 4^) 

+ V,- + p‘^"mdkRiln"' 

+ B)m^ ^Ikn^ + ^J^'^rndk^ln^ 

+ Iv,- dkQmn, (85) 

+ w 

+ v,- 

I o ^ I Jni J? ^ Jnm D i 

\ m^kln P m^lnk P k^lnm 

+ V„ - Ai®") 9jk = 0. (86) 

Notice that the variational derivative (74) with respect to the matter fields can be 
identically rewritten as 



5£ 

and turns out to be a covariant tensor density. It is also worthwhile to note, that the 
variational derivative w.r.t. the metric is explicitly a covariant density. This follows 
from the fact that the Lagrangian depends on pij not only directly, but also through 
the objects Qkij and Nki-^ . Taking this into account, we find 

62 _ d2 g ( 

^Qij ^9ij xddnQij 

= - lv„ - Pvln\ (gg) 

dgij 2 \ } 

The Noether identity (86) is a covariant relation. In contrast, (85) is apparently non- 
covariant. However, this can be easily repaired by replacing Qk = ^ with an equiv- 
alent covariant Noether identity: Q'k •= ^k ~ = 0- Explicitly, we find 
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+ Qkij 



:n 



m 



= 0 . 



(89) 



On-shell, i.e. assuming that the matter fields satisfy the field equations 5Z/5\jj^ = 
0, the Noether identities (86) and (89) reduce to the conservation laws for the energy- 
momentum and hypermomentum, respectively. 

Equation (86) contains a relation between the canonical and the metrical energy- 
momentum tensor, and the conservation law of the hypermomentum. In the next 
section we turn to the discussion of models with general nonminimal coupling. 



4 Conservation Laws in Models with Nonminimal 
Coupling 

The results obtained in the previous section are applicable to any theory in which 
the Lagrangian depends arbitrarily on the matter fields and the gravitational field 
strengths. Now we specialize to the class of models described by an interaction 
Lagrangian of the form 



Here is the ordinary matter Lagrangian. We call F = F{gij, 

^kliK the coupling function and assume that it can depend arbitrarily on its 
arguments, i.e. on all covariant gravitational field variables of MAG. When F = I 
we recover the minimal coupling case. 



4.1 Identities for the Nonminimal Coupling Function 

As a preliminary step, let us derive identities which are satisfied for the nonminimal 
coupling function F = F{gij, Rkuf Nu^). Lor this, we apply the above Lagrange- 
Noether machinery to the auxiliary Lagrangian density £q = \/^ F. This quantity 
does not depend on the matter fields, and both (86) and (89) are considerably sim- 
plified. In particular, we have 



L = Figij, Rkiik Nkh 



(90) 
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^9ij 






F‘J ■- 



5F 

^9ij' 



(91) 



Then we immediately see that (86) and (89) reduce to 






-fiJ _ _ 
2 



lv„ _|_ F^nj)i _ j 



Qkij 



+ P^^\ykRiln'” +F‘\ykNln 



2Fk' ^ Ekin'" - P‘"^mRlnk'" + P‘"’"kRl, 

- P‘"kNln‘ + P“nNkl" + P“nNlk" 



Ini 



rJnm 



+ v, 



^(ni)j 



+ I 9jk. 






(92) 



(93) 



Here we denoted 



^ijk . 
p-^ I := 



dF 

dRijk‘ ’ 



0 .. 

P‘h 



dF 

dNi/ ■ 



(94) 



Notice that for any tensor density we have relations (30) and (29). 

The identity (92) is naturally interpreted as a generally covariant generalization of 
the chain rule for the total derivative of a function of several variables. This becomes 
obvious when we notice that (88) implies 



-F‘j - ^ V„ j ^ S7kgij. (95) 

It should be stressed that (92) and (93) are true identities, they are satisfied for any 
function F(gij, RkuK Nu^) irrespectively of the field equations that can be derived 
from the corresponding action. 



4.2 Conservation Laws 

Now we are in a position to derive the conservation laws for the general nonminimal 
coupling model (90). Recall the definitions of the canonical energy-momentum tensor 
(57), the canonical hypermomentum tensor (58) and the metrical energy-momentum 
tensor (60). 

In view of the product structure of the Lagrangian (90), the derivatives are easily 
evaluated, and the conservation laws (86) and (89) reduce to 
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- Ftk‘ - V„ (fa'/) + + ^2Fk‘ 

+ 2p‘^\Rkin'^ - h"'kRlnJ 

+ - p“„Nki" - P^\Nik’^ 

- V„ - F®") gjk]L^.t = 0, (96) 

Vi (fS^') + f[-E,'f/ + A'^jRkim" + \t‘jQkii\ 

+ {p‘‘\ykRiln"‘ + + ["F^ 

-1 V„(f("‘)^' + - F®”)] G^yjLmat = 0. (97) 

After we take into account the identities (92) and (93), the conservation laws (96) 
and (97) are brought to the final form: 

FS^' = Ftk‘ + V„ (fa'/) , (98) 

Vi (fE^') = F (^E/'F/ - A'^jRkim’' - - FmatV^F. (99) 

Lowering the index in (98) and antisymmetrizing, we derive the conservation law 
for the spin 

FE[y-] + V„ (Ft/) + Qni[i a‘ jf = 0. (100) 

This is a generalization of the usual conservation law of the total angular momentum 
for the case of nonminimal coupling. 



4.3 Rewriting the Conservation Laws 

Using the definition (31) and decomposing the connection into the Riemannian and 
non-Riemannian parts (10), we can recast the conservation law (98) into an equivalent 
form 



- tk^ + NnjA^k^ - Nnk^A^m^). ( 101 ) 

In a similar way we can rewrite the conservation law (99). At first, with the help of 
(12) and (13) we notice that 



(102) 
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Then substituting here (101) and making use of (10), (31), and the curvature decom- 
position (14), after some algebra we recast (99) into 



V,-{F(E^^‘ + = - FA^n\Rkim^ ~ ^kNinT) ~ L^^.VkF. (103) 



For the minimal coupling case, such a conservation law was derived in [46, 47]. 
The importance of this form of the energy-momentum conservation law lies in the 
clear separation of the Riemannian and non-Riemannian geometrical variables. As 
we see, the post-Riemannian geometry enters (103) only in the form of the distortion 
tensor which is coupled only to the hypermomentum current • This means 
that, in the minimal coupling case, ordinary matter — i.e. without micro structure, 
A^n = 0 — does not couple to the non-Riemannian geometry. In contrast, in the 
nonminimal coupling case, the derivative of the coupling coupling function F on the 
right-hand side of (103) may lead to a coupling between non-Riemannian structures 
and ordinary matter. 



4.4 Conserved Current Induced by a Spacetime Symmetry 

Every generalized Killing vector field generates a conserved current. This can 
be demonstrated from the analysis of the system (101) and (103) as follows. Let us 
contract equation (101) with and Eq. (103) with then subtract the resulting 
expressions. Note that the contraction tk^ V/ = 0 vanishes because the first factor is 

a symmetric tensor and the second one is skew-symmetric. Then after some algebra 
we derive 



Note that the transposed connection appears here. The right-hand side of (104) 
depends linearly on the Lie derivatives along the Killing vector: C(^F = C^V^F and 



- (VkC)NiJ. (106) 



When is a generalized Killing vector, we have Cc^Nim^ = 0 in view of (46). 
Furthermore, recalling that F = F{gij, we find 




(104) 



Here we associate a current with a Killing vector field via 




(105) 




by making use of (46), (47), and (49). 
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As a result, the right-hand side of (104) vanishes for the generalized Killing vector 
field, and we conclude that the induced current (105) is conserved 



This generalizes the earlier results reported in [30, 48, 49]. In Sect. 6.1.2 we will 
show that there is a conserved quantity constructed from the multipole moments 
which is a direct counterpart of the induced current (105). It is worthwhile to give 
an equivalent form of the latter: 



4.5 Riemannian Limit 

Our results contain the Riemannian theory as a special case. Suppose that the torsion 
and the nonmetricity are absent Ti^ = 0, Qkij = 0, hence Nf/ = 0. Then for usual 
matter without microstructure (i.e. matter with = 0) the canonical and the 

metrical energy-momentum tensors coincide, = tk^. Asa. result, the conservation 
law (99) reduces to 



This conservation law for the general nonminimal coupling model was derived earlier 
in [26] without using the Noether theorem, directly from the field equations.^ The old 
result established the conservation law for the case in which F = F(Rijk^) depends 
arbitrarily on the components of the curvature tensor, correcting some erroneous 
derivations in the literature, see [26] for details. 

Quite remarkably, (110) generalizes the earlier result to the case in which the 
nonminimal coupling function F is a general scalar function of the curvature tensor. 



5 General Multipolar Framework 

In this section we derive “master equations of motion” for a general extended test 
body, which is characterized by a set of currents 




(108) 







(109) 




( 110 ) 






(Ill) 



^Notice a different conventional sign, as compared to our previous work [26]. 
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Normally, these are so-called Noether currents that correspond to an invariance of 
the action under a certain symmetry group. However, this is not necessary, and any 
set of currents is formally allowed. We call dynamical currents. The generalized 
index (capital Latin letters A, B, .. .) labels different components of the currents. 

As the starting point for the derivation of the equations of motion for generalized 
multipole moments, we consider the following conservation law: 

Vj = - Kjb'^ ( 112 ) 

On the right-hand side, we introduce objects that can be called material currents 

K'^ (113) 

to distinguish them from the dynamical currents /V 7he number of components 
of the dynamical and material currents is different, hence we use a different index 

with dot A, 5, . . ., the range of which does not coincide with that of A, B, At 

this stage we do not specify the ranges of both types of indices, this will be done for 
the particular examples which we analyze later. As usual, Einstein’s summation rule 
over repeated indices is assumed for the generalized indices as well as for coordinate 
indices. 

Both sets of currents and are constructed from the variables that describe 
the structure and the properties of matter inside the body. In contrast, the objects 

Ajb^, (114) 



do not depend on the matter, but they are functions of the external classical fields 
which act on the body and thereby determine its motion. The list of such external 
fields include the electromagnetic, gravitational and scalar fields. 

We will now derive the equations of motion of a test body by utilizing the covariant 
expansion method of Synge [7]. For this we need the following auxiliary formula 
for the absolute derivative of the integral of an arbitrary bitensor density = 

(v, y) (the latter is a tensorial function of two spacetime points): 



I / 

E(^) 



= J J (115) 

S(5) S(i) 



Here := /ds, s is the proper time, £ = V/, and the integral is performed 

over a spatial hypersurface. Note that in our notation the point to which the index of a 
bitensor belongs can be directly read from the index itself; e.g., denotes indices at 
the point y . Furthermore, we will now associate the point y with the world-line of the 
test body under consideration. Here a denotes Synge’s [7] world- function and its 

first covariant derivative; g^x is the parallel propagator for vectors. For objects with 
more complicated tensorial properties the parallel propagator is straightforwardly 
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generalized to x and We will need these generalized propagators to deal 
with the dynamical and material currents and More details are collected in 
the appendix on our conventions. 

After these preliminaries, we introduce integrated moments for the two types of 
currents via (for ^ = 0, 1, . . .) 

S(t) 

iyi...ynYoy' ^ (-i)^ ( 117 ) 

S(t) 

S(t) 



Here the densities are constructed from the currents: and = 

. Integrating (112) and making use of (1 15), we find the following “master 
equation of motion” for the generalized multipole moments: 



£_ ;yi-ynYo 

ds-' 



— n yGi jy2-yn)Yo ^ iiyi-yn-i\Yo\yn) 



- Ayry<>iy^-y’‘^"y' - 

— nh) • my^"-ynY' _ rrfi) • ^yi---yn+lY' 

11 Y'^ A1 Y';yn+i^ 

r_n^^7n/Gi , ;y2-yn)yn+i-yn+kYoy 







+ (-1)^0^,, 



y'yn+i--yn+k 



- A lyi-yn+kY^'y' 



"^y'Y" ;yn+i...yn+k^ 



-n 



ho . 



Y';yn+i...yn+k 



yi...yn+kY‘ 



Y" 

1 



yn+kY'y" 



+ 7 



yi...yn+kY' yX 



(119) 



5.1 Electrodynamics in Minkowski Spacetime 



To see how the general formalism works, let us consider the motion of electrically 
charged extended test bodies under the influence of the electromagnetic field in fiat 
Minkowski spacetime. This problem was earlier analyzed by means of a different 
approach in [50]. 
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In this case, it is convenient to recast the set of dynamical currents into the form 
of a column 

where is the electric current and is the energy-momentum tensor. Physically, 
the structure of the dynamical current is crystal clear: the matter elements of an 
extended body are characterized by the two types of “charges”, the electrical charge 
(the upper component) and the mass (the lower component). 

The generalized conservation law comprises two components of different tensor 
dimension: 

’^(5.)= (-Aid 



where the lower component of the right-hand side describes the usual Lorentz force. 

Accordingly, we indeed recover for the dynamical current (120) the conservation 
law in the form (112), where = 0 and 




( 122 ) 



The generalized moments (116)-(118) have the same column structure, reflecting 
the two physical charges of matter: 



jyvynYQ 

lyvynYoy' 



( 

( 



jyv-yn \ 
pyvynyQ j ’ 


(123) 


pi-yny' \ 

j^yv-ynyoy' J ’ 


(124) 



whereas = 0. 

As a result, the master equation (119) reduces to the coupled system of the two 
sets of equations for the moments: 



D 

ds'^ 



-yv-yn 



^jyyv-ynyo ^ 

ds^ 






= yiyi pyi-yn)yo ^^(yi-yn-ilyobn) 



j^yyofyi-yny 



00 

Z — F 
k\ ^ ’ 



\yn+\---yn+k 



(125) 



fyi-yn+ky _ (i 26 ) 



k=i 



These equations should be compared to those of [50]. 
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6 Equations of Motion in Metric- Affine Gravity 



We are now in a position to derive the equations of motion for extended test bodies 
in metric-affine gravity. As a preliminary step, we rewrite the conservation laws (98) 
and (99) as [22]: 



- tk\ 



(127) 



= - Vj\^J - Rkjm"A'"„j - -QkftJ - Ak Lmat. (128) 



Here we denoted Ak := log F, and 

UjJ^ := Aj6‘jl - Njj5l + 
VjJ^ := Aj5i + N’^jn. 

Introducing the dynamical current 



and the material current 






/ \ 

V f^mat / 



(129) 

(130) 



(131) 



(132) 



we then recast the system (127) and (128) into the generic conservation law (112), 
where we now have 






11 ^ — 





-s)K. 


K^jn- 


yjk'^ , 








A' j ■ 



) 



(133) 



(134) 



Like in the previous example of an electrically charged body, the matter ele- 
ments in metric-affine gravity are also characterized by two “charges”: the canonical 
hypermomentum (upper component) and the canonical energy-momentum (lower 
component). This is reflected in the column structure of the dynamical current (131). 
The material current (132) takes into account the metrical energy-momentum and the 
matter Lagrangian related to the nonminimal coupling. The multi-index A = [ik, k}, 
whereas A = [ik, 1}. Accordingly, the generalized propagator reads 



X 





0 ' 


0 





(135) 
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and we easily construct the expansion coefficients of its derivatives from the corre- 
sponding expansions of the derivatives of vector propagator x • 



The last step is to write the generalized moments (116)-(118) in terms of their 
components: 



For the two most important moments, “/z” stands for the hypermomentum, whereas 
“p” stands for the momentum.^ Finally, substituting all the above into the “master 
equation” (1 19), we obtain the system of multipolar equations of motion for extended 
test bodies in metric-affine gravity: 



^Note that in order to facilitate the comparison with our previous work [51], we provide in appen- 
dix the explicit form of integrated conservation laws (127) and (128), as well as the generalized 
integrated moments (141)-(143) in the notation used in [51]. 




(136) 



where we denoted 




In particular, for the first expansion coefficient (^ = 1), we find 




(138) 



(139) 



For completeness, let us write down also another generalized propagator 




(140) 




(141) 




(142) 




(143) 
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j^yi-ynyayb 

ds 



— n + n q^y^-yn-i\yayb\yn) 

^j^yi-ynybya _ jj^yi-ynyayb 



2^^''y'y"yn+i 



^qyi-yn+iy'yby" y/' j^yi-yn+iy'yb^ 

^qyi-yn+iyay'y" y/' ij^yi-yn+iya/^ 



jj , „yayb^y\-yny y yo _tj , „yayb ^yi-yn+iy y yo 
^yoy'y" ^ ^yoy'y" ‘,yn+i^ 



k=2 L 

+ (-i)S"“y/'y„+i...y„+, {qy^-y''y^y'y'’y" + vy"hy^-y-^>^y'y») 
+ (qy^-y-^>^y‘‘y'y" + 

_ {—\)^n a^^^yy 3 ; ^qy2-yn)yn+i-yn+kyayby' 



JJ , ,,yayb ^yi-yn+kyy 

^yoy'y" ‘,yn+i-yn+k^ 



yo 



( 144 ) 



:^pyi-ynya 

ds^ 



— n v^y^ py2-yn)ya ^ fjj^iyi-yn-i\ya\yn) 

_ j^ya^yi-yn _ ^^^yi-yn+i 

- Vy.yy-ky^-y-y'y^' - 



]?ya , „ 

2 yyyn+\ 



-Rya 



(ky^- 



yn+\y y 



y j.yi-yn+iy 



+ v^p 



') 



yoyy 



,^yi-yny'y"yo _ j?ya , „ ^yi-yn+i//Vo 
q IS. yoyy';yn+i^ 



-oy^ n niy^-y^y’y” — // / 

y'yp y"y'\yn+\k^ 



00 ^ r 

k=2 ^ 

+{~yy%' 



y y yn+\-yn+k 



{ky^- 



■yn+ky y 



+ p 



y ny^-yn+ky 



') 



( l)^n + 

“ yoy'y"',yn+i-yn+k^^^"' 



— V // ,y^ ky^-y^+ky y 

^y y ‘,yn+i-yn+k'^ 



j^y2-yn)yn+i-yn+kyay' 



--nya 



Q^"'y"y;yn+i...yn+kd'' 



yi... yn+ky y 



_j^ya 



dyi--yn+k 

‘,yn+l---yn+k‘^ 



( 145 ) 
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6.1 Special Cases 

The general equations of motion (144) and (145) are valid to any multipolar order. 
In the following sections we focus on some special cases, in particular we work out 
the two lowest multipolar orders of approximation, and consider the explicit form of 
the equations of motion in special geometries. 



6.1.1 General Pole-Dipole Equations of Motion 

From (144) and (145), we can derive the general pole-dipole equations of motion. The 

relevant moments to be kept at this order of approximation are: p^, 

kab ^ k^bc^ jj^ab^ jj^abc ^ ^ ^ Since all objects are now evaluated on the world-line. 



we switch back to the usual tensor notation. 

For n = I and n = 0, Eq. (144) yields 

0 = (146) 

^hab ^ J^ba _ ^ab _ U^^ab^dec^ ^47^ 

ds 

Furthermore for ^ = 2, 1, 0 Eq. (145) yields 

0 = (148) 

D ^ab ^ j^ba _^apb _ j^b _ y^bj^acd _ i gb^^^acd^ ^ J49) 

ds 2 

^ T/ ajhc na bed ^ ,,bc 

-rP = -Veb k - R dbcq - cbP 
ds 2 

- Vdd,hk^^^ - (150) 



Rewriting the equations of motion Let us decompose (146) and (147) into sym- 
metric and skew- symmetric parts: 

^abc ^ i^a(bc) 4. ^ibc)a _ ^af^ibc)^ 

0 = +q^bc]a 

l_l“b = - T/jW + k(ab) _ y^^m^dec^ 
ds 



(151) 

(152) 

(153) 



( 154 ) 
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As a result, we can express the moments symmetric in the two last indices 

and (in general, this is possible also for an arbitrary order — 

^c\...cn{ab)^^ in terms of the other moments. 

Let us denote the skew-symmetric part := as this simplifies greatly the 
subsequent manipulations and the comparison with [51]. 

The system of the two equations (148) and (152) can be resolved in terms of the 
3rd rank /^-moment. The result reads explicitly 



j^abc ^ ^a^cb ^p[ab] _ ^ab^ ^b ^^iac] _ ^^[bc] _ 



+ q[‘^b]c _^^[ac]b _^^[bc]a_ 



(155) 



This yields some useful relations: 

l^a[bc] ^ _ya^bc _^_^[bc]a^ 

l^[ab]c ^ ^[ap\c\b] ^ (p[ab] _ ^ ^[ab]c 



(156) 

(157) 



The next step is to use the Eqs. (151), (153) together with (155) and to substitute 
the /i-moments and ^-moments into (147), (149) and (150). This yields a system that 
depends only on the p,h,q, and ^ moments. 

Let us start with the analysis of (150). The latter contains the combination + 

y[dpb]c ihQ skew symmetry is imposed by the contraction with the Riemann 

curvature tensor, which is antisymmetric in the two last indices. Making use of (155), 
we derive 



j^[a\c\b] ^[bpa]c ^ ^abc ^acb _ ^bca ^ 

where we introduced the abbreviation 

:= A 

Note that by construction . 

Then by making use of the Ricci identity we find 

^ ( hbcd I ^,d hc\ na I ^[cd]b . 

- -R cdb\k +v p j = ^ bed V yp'^ ^ - S jj . 



(158) 



(159) 



(160) 



Substituting from (149) and pf^^ from (153), we find after some algebra 

- Veb^k^^ - = - Ab^ - N"cd^ - [p^ + N\dh^^) v^Ab 

- k>’‘^^AhAc + {N\hNdc" - N\nNd\) ■ (161) 
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Further simplification is achieved by noticing that 

DA DA 

v>’Ah = k>’^^AhAc = — , (162) 

as as 

where we used (148) and recalled that = A;^. 

Analogously, taking from (156) and from (151), we derive 



T7 <2 j^bcd /^a ..bed a j^cab \ -kjo. ^edb \jci ^,bj^c 

-Vdc -M - -Q dc-bP = - Ah-^ck + N cd-,b<l -N cd-bV h 



(163) 



We can again use Ab = A-b and (148) to simplify 

- (164) 

as 

After these preliminary calculations, we substitute ( 1 60)-( 1 64) into ( 1 50) to recast 
the latter into 

£ {Pp‘^ + FN\dh^‘^ + p^^'Vbp) = FR\cdV^ 

_FqCbd + N‘^,^N/b] 

- M" - FSj’A^.^b- (165) 

Finally, combining (147) and (149) to eliminate we derive the equation 

£ _ i^ab-^ = ^ab _ ^^b ^ 

+ q^‘^N^cd - q^’’‘‘Ndc^ + q‘^^‘‘N/c 
- A^ + - k‘^‘^‘=)Ac. (166) 

Following [51], we introduce the total orbital and the total spin angular moments 

j_^ab ._ 2p^ab]^ ^ab ._ _2h^ab] ^ 

and define the generalized total energy-momentum 4-vector and the generalized total 
angular momentum by 



-pa ._ p^pa p p pbap^p^ ( 168 ) 

:= F{L‘^’’ + S“^). (169) 



Then, taking into account the identity (14), which with the help of the raising and 
lowering of indices can be recast into 
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V^Ndcb = -R%cb + R%cb + N\b,d + N\bNdc^ - Nd\N\n. (170) 

we rewrite the pole-dipole equations of motion (165) and (166) in the final form 

DV^ 1 ^ ^ 

= ^R\cdv‘’r‘‘ + Fq<^>’dV^Ndcb 
as 2 

(171) 

D 7^^ 

+ 2F{q‘=‘^^‘^N^^cd + N N d’’^ c) 

ds 

_2^[aV^]f-. (172) 

The last equation arises as the skew- symmetric part of (166), whereas the symmetric 
part of the latter is a non-dynamical relation that determines the moment 

= 7 ^ + Lyia (^b) ^ jb)c^^^ ^ ^(a^b) 

- q‘^^^‘'N>’\d + 

+ - q^‘^’’^‘=)Ac. (173) 

Here the symmetric moment of the total hypermomentum is introduced via 

■^ab p(ab) _ f^(ab) _ ^ 74 ^ 



6.1.2 Conserved Quantity 

The equations of motion for the multipole moments are derived from the conser- 
vation laws of the energy-momentum and the hypermomentum currents and 
. In Sect. 4.4 we demonstrated that every generalized Killing vector induces 
a conserved current constructed from and A^^ . Quite remarkably, there is a 
direct counterpart of such an induced current built from the multipole moments. 

Let be a generdized Killing vector, and let us contract equation (171) with (a 
and Eq. (172) with ^ and then take the sum. This yields 

2 ^V‘^Ca + 1 = Fq^^‘^C^Ndcb -^C^F- e%^(F. (175) 

On the right-hand side, the Lie derivatives of the distortion tensor and of the coupling 
function both vanish in view of (46) and (107). 

Consequently, we conclude that for every generalized Killing vector field the 
quantity 
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+ \j“‘’^aCb = const. (176) 

is conserved along the trajectory of an extended body. 

We thus observe a complete consistency between (104), (105), (109) and (176), 
(175). 



6.1.3 Coupling to the Post-Riemannian Geometry: Fine Structure 

Let us look more carefully at how the post-Riemannian pieces of the gravitational 
field couple to extended test bodies. At first, we notice that the generalized energy- 
momentum vector (168) contains the term that describes the direct inter- 

action of the distortion (torsion plus nonmetricity) with the intrinsic dipole moment 
of the hypermomentum. Decomposing the latter into the skew- symmetric (spin) part 
and the symmetric (proper hypermomentum -f dilation) part, we find 

N\dh^‘^ = - (177) 

Here we made use of (13). This is well consistent with the gauge-theoretic structure 
of metric-affine gravity. The second term shows that the intrinsic proper hypermo- 
mentum and the dilation moment couple to the nonmetricity, whereas the first term 
displays the typical spin-torsion coupling. 

Similar observations can be made for the coupling of higher moments which 
appear on the right-hand sides of (171) and (172) — and thus determine the force and 
torque acting on an extended body due to the post-Riemannian gravitational field. In 
order to see this, let us introduce the decomposition 

-]^q^bc ^^abc j^^qcab 

into the two pieces 

1 ^q[ac]b ^ ^[bc]a _ ^(ab)c'^ ^ 

q<^bc 1 ^q[ab]c ^[ac]b _ ^[bc]a^ ^^gQ^ 

The overscript “J” and “5'” notation shows the relevance of these objects to the 
dilation plus proper hypermomentum and to the spin, respectively. By construction, 
we have the following algebraic properties 

q[ab]c ^ Q q{ab)c ^ q 



(181) 
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Making use of the decomposition (178) and of the explicit structure of the distor- 
tion (11), we then recast the equations of motion (171) and (172) into 



DV^ 

ds 



ds 



\R\cdv^r‘^ 

+ Qdcb 

- 2v^‘^V’’^ 

+ Q^\d + 2c/’) 

_2^[aV^]F. 



(182) 



(183) 



Now we clearly see the fine structure of the coupling of extended bodies to the post- 
Riemannian geometry. The first lines in the equations of motion describe the usual 
Mathis son-Papapetrou force and torque. They depend on the Riemannian geome- 
try only. A body with the nontrivial moment (180) is affected by the torsion field, 
whereas the nontrivial moment (179) feels the nonmetricity. This explains the dif- 
ferent physical meaning of the higher moments (179) and (180). In addition, the last 
lines in (182) and (183) describe contributions due to the nonminimal coupling. 



6.1.4 General Monopolar Equations of Motion 

At the monopolar order we have nontrivial moments and The non- 

trivial equations of motion then arise from (144) for ^ = 0 and from (145) for 
n = l,n = 0\ 



0 = k’’‘^- 
0 = k‘’“- v“p'’, 

2^ = - v,b‘^k^‘^ - 
ds 2 



The first two Eqs. (184) and (185) yield 

j^[ab] ^ ^[apb] ^ 

and substituting (184), (185) and (187) into (186) we find 

2W=_ev«F. 



(184) 

(185) 

(186) 



(187) 



ds 



(188) 
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From (187) we have = Mv^ with the mass M := pa, and this allows to recast 
(188) into the final form 



= (189) 

as F 

Hence, in general the motion of nonminimally coupled monopole test bodies is non- 
geodetic. Furthermore, the general monopole equation of motion (189) reveals an 
interesting feature of theories with nonminimal coupling. There is an “indirect” cou- 
pling, i.e. through the coupling function F(gij, RijkK Tf/, Qi/)^ of post-Riemannian 
spacetime features to structureless test bodies. 



6.1.5 Weyl-Cartan Spacetime 

In Weyl-Cartan spacetime the nonmetricity reads Qkij = Qkdij, where 2)^ is the 
Weyl covector. Hence the distortion is given by 

Nki = Kkf + 1 [&gkj - Qk5) - QjSi) . (190) 

The contortion tensor is constructed from the torsion, 

Kk/ = - \(Tk/ + T^kj + T^jk)- (191) 

As a result, the generalized momentum (168) in Weyl-Cartan spacetime takes the 
form: 



= Fp‘‘-^ - QbS‘’“ + 2'^z)) + p‘’“VbF. (192) 

Here we introduced the intrinsic dilation moment D := Qabh^^ • 

Substituting the distortion (190) into (171) and (172), we find the pole-dipole 
equations of motion in Weyl-Cartan spacetime: 

= -R“bcdv‘’J‘^‘^ + Fq‘=’’‘^V“Tcbd 
ds 2 

+ Z^V‘^Qb-iV‘^F -i’’Vb^‘'F, (193) 

D 7^^ s s 

ds 



(194) 
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Here we introduced the trace of the modified moment (179) 

rja . ^ ihca ( ^bac ^bca ^abc\ 

^ := 9bc^ = -Qbc [q -q - q j • (195) 

It is coupled to the Weyl nonmetricity. 



6.1.6 Weyl Spacetime 

Weyl spacetime [52] is obtained as a special case of the results above for vanishing 
torsion. Hence the contortion is trivial 



Kabc = 0. (196) 

Taking this into account, the generalized momentum (192) and the equations of 
motion (193) and (194) are simplified even further. 

It is interesting to note that besides a direct coupling of the dilation moment to 
the Weyl nonmetricity on the right-hand sides of (193) and (194), there is also a 
nontrivial coupling of the spin to the nonmetricity in (192). 



6.1.7 Riemann-Cartan Spacetime 

Another special case is obtained when the Weyl vector vanishes Qa = 0. Equa- 
tions (192)-( 194) then reproduce in a covariant way the findings of Yasskin and 
Stoeger [53] when the coupling is minimal {F = 1). For nonminimal coupling we 
recover our earlier results in [51]. 



7 Equations of Motion in Scalar- Tensor Theories 

The geometrical arena of scalar-tensor theories is the Riemannian spacetime, hence 
Nk/ = 0 which means that both the torsion Ti^ = 0 and the nonmetricity Qkij = 0 
vanish. 

Scalar-tensor theories have a long history and they belong to the most straightfor- 
ward generalizations of Einstein’s general relativity theory. In the so-called Brans- 
Dicke theory [54-58] a scalar field is introduced as a variable “gravitational coupling 
constant” (which is thus more correctly called a “gravitational coupling function”). 
Similar formalisms were developed earlier by Jordan [59, 60], Thiry [61] and their 
collaborators using the 5 -dimensional Kaluza- Klein approach. An overview of the 
history and developments of scalar-tensor theories can be found in [62-65]. 
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Surprisingly little attention was paid to the equations of motion of extended test 
bodies in scalar-tensor theories. Some early discussions can be found in [56, 66, 67], 
and in [68] the dynamics of compact bodies was thoroughly studied in the framework 
of the post-Newtonian formalism. 



7.1 Scalar-Tensor Theory: Jordan Frame 

We consider the class of scalar-tensor theories along the lines of [68] where the 

/ j 

action I = f ux £ is constructed on the manifold with the spacetime metric 9ij. 
J J J 

The Lagrangian density £ = has the following form 

L = +9‘j^ABdi(p^djcp'^ - 2l?j + d^, ij). (197) 

This action is an extension of standard Brans-Dicke theory [69] to the case when we 
have a multiplet of scalar fields (capital Latin indices A, B,C = 1,...,A^ label 
the components of the multiplet). Here k, = ^ttGIc^ denotes Einstein’s gravitational 
constant and in general we have several functions of scalar fields, 

F = F{ip^), u = u{ip^), 1 ab=1ab{f^). (198) 

J 

The Lagrangian Lni(V^, 9ij) depends on the matter fields and the gravitational 
field. 

J 

The metric 9ij determines angles and intervals in the Jordan reference frame . The 
^ J 

Riemannian curvature scalar R{9) is constructed from the Jordan metric. With the 
help of the conformal transformation 

9ij 9ij = F^9ij (199) 

we obtain a different metric on the spacetime manifold. This is called an Einstein 
reference frame. 



7.2 Conservation Laws: Einstein Frame 



In the Einstein reference frame the Lagrangian density in the scalar-tensor theory 
reads £ = with 






-R + g^'^ABdi^p^dip 






-2 



9ij). (200) 
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Here the scalar curvature R{g) is constructed from the Einstein metric gij, and 

lAB = ^(7a5 + aE^), U = -^U. (201) 

The metrical energy-momentum tensor for the matter Lagrangian Lmat is constructed 
as usual via (60). From the Noether theorem we find that it satisfies the generalized 
conservation law 

Ytij=^{Atij-gijt)&F. ( 202 ) 

r 

Here t \= g^hg. The derivation is given in [23]. 

To begin with, we recast (202) into an equivalent form 

y-tV = -Ai{a‘j + t'j) (203) 

by introducing 

Ai := di log E^j := -/f/4. (204) 



7.3 General Equations of Motion: Arbitrary Multipole Order 



We now derive the equations of motion for extended test bodies in scalar-tensor 
gravity by making use of the master equations obtained for the general case of 
MAG. In scalar-tensor theory, the hypermomentum is zero = 0. Following the 
general scheme, we introduce the dynamical current which is a special case of the 
MAG current: 









(205) 



Taking into account the structure of the conservation laws (203) and (204), we define 
the material current 

/ \ 

(206) 






K^ = 



we then recast the system (203) into the generic conservation law, where we now have 




(207) 

(208) 



In accordance with (203) and (204) we now have 

Ai = di log F-\ 



Vj/ = Aj5l 



(209) 
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In view of (205) and (206) the generalized moments read 



jyi-ynY = ( ,) 

\py^-yny /’ 

•yi...y„Fyo _ / ^ \ 

\j^yvyny yo J ’ 

\ ^yi-yn J • 



( 210 ) 

( 211 ) 

( 212 ) 



It is worthwhile to notice that although we formally use a different notation for 
the upper and lower components of the moment (212), they are not independent. 
Recalling (206), we have the obvious relation 

(213) 



which we will take into account when rewriting the equations of motion. 

Now we can make use of the general MAG equations of motion (119). The 
first equation is a degenerate version of (144) since we have hy^-y^y^y^ = 0 and 
qyi--yn-iyaybyn _ ^nd we are left with the algebraic relation 



j^yi-ynytya — jj^yi-ynyayt ^ 



(214) 



for all n. Hence we find 



= - -gyy.ky^-y’^y y , 



(215) 



and the second equation (145) then gives rise to the equations of motion in scalar- 
tensor theory 



^^yi-ynya 

ds^ 



— n i;(3'1^3'2...yn)ya yiJ^{y\-yn-\\ya\yn) 
_y^ya^y\-yn _ js^ya^^ ^^^yi--yn+i 

— V // ,yahy\-yny'y'' ,ya j^yi...y„+iyy' 

yy'y /c vyty 



1 -V 

]?ya , „ 

2 ^ y y yn^x 



y- 



yn+iy y 



+ p 



yi-yn+ir 



') 



oo 

y- 



k=2 



_ (—^\kv^n;^yl , hyi--yn)yn+\--yn+kyay' 



.y a{y\ , rty^-yn)yn+\-yn+kya 

-TK n U P yy^+i...yn+kP 

^-(—^\k ya , ,, (j.yi-yn+kyy \ .y rjy^-yn+ky' 

1 y'y"yn+i...yn+k P 



') 



_ V // ,y^ hyi-yn+ky y _ ^ya 

^y y \yn+\--yn+k'^ ^ \yn+\-yn+k 



e- 



yn+k 



. (216) 
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The system (216) is valid up to any multipole order. In the following we specialize 
it to the dipole and the monopole case. 



7 A Pole-Dipole Equations of Motion 
in Scalar-Tensor Theory 

For ^ = 2, 1, 0 Eq. (216) yields 

0 = _ y(apb)c^ (217) 

D ^ab ^ i^ba _ ^apb _ ^b (218) 

as 

-Vcb‘'k>’^ - Vdf,bk'’^^ - A‘',b£!’ 
as 

- \R\db . (219) 

Taking into account that = 0, we resolve (217) in a standard way to find 

explicitly 

j^abc ^ ^a^cb ^Cp[ab] ^b ^[ac] ^ ^[bc] ^ ^220) 

In view of (215), we have in addition 



= - \gbck‘''^‘^. 


(221) 


C = - \gabk‘^’’- 


(222) 



Then repeating the same algebra as we did in MAG, we recast the system (218) and 
(219) into 



DV^ 1 _ _ _ _ 

= ■rR\cdV^J^‘^ - 
as 2 



DJ' 



ab 



ds 






(223) 

(224) 



Here, following [51], we have the total orbital angular moment := 
Whereas the generalized total energy-momentum 4-vector and the generalized total 
angular momentum are introduced by 

;= F-^p“ + p'’‘^VbF~^, 
j-ab . j^—Aj^ab 



(225) 

(226) 
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7.5 Monopolar Equations of Motion 
in Scalar-Tensor Theory 



At the monopolar order we find from Eq. (216) for n = 1 , n = 0: 

0 = k'’‘‘- v“p'’, 

Dn^ 



(227) 

(228) 



Making use of = 0, the first equation yields = 0, hence we have 



= Mv^ 



(229) 



with the mass M := v^Pa- Substituting (209), (227) and (229) into (228) we find 



1 = _ F~^ 

ds ^ 

Contracting this with Va , we derive 



ds 



^V^WaF 



-4 



we write (230) in the final form 



M 



Dv^ 

ds 






- v“v'’) 



^bF-^ 

/ 7-4 



Combining (222) with (227) we find 



4 ~ 4 ■ 

Substituting this into (232), we obtain 



Dv^ 

ds 



(9“‘’ 



V^V^) 



^bF 

F 



(230) 



(231) 



(232) 



(233) 



(234) 



Quite remarkably, we thus find that the dynamics of an extended test body in the 
monopole approximation is independent of the body’s mass. In case of a trivial 
coupling function F, Eq. (234) reproduces the well known general relativistic result. 

Interestingly, the mass of a body is not constant. Substituting (233) into (231) we 
can solve the resulting differential equation to find explicitly the dependence of mass 
on the scalar function: M = F^Mq with Mq = const. 
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8 Conclusions 



We have presented a general multipolar framework of covariant test body equations of 
motion for standard metric-affine gravity, as well as its extensions with nonminimal 
coupling between matter and gravity. Our results cover gauge theories of gravity 
(based on spacetime symmetry groups), and various so-called f(R) models (and 
their generalizations), as well as scalar- tensor gravity. 

Our results unify and extend a whole set of works [26, 51,53, 70-78] . In particular 
they can be viewed as a completion of the program initiated in [73], in which a 
noncovariant Papapetrou [2] type of approach was used. The general equations of 
motion (144) and (145) cover all of the previously reported cases. As demonstrated 
explicitly, the master equation (119) allows for a quick adoption to any physical 
theory, as soon as the conservation laws and (multi- )current structure is fixed. Table 1 
contains an overview of our main results in the context of metric-affine gravity. In 
particular, the explicit equations of motion at the pole-dipole and monopole order are 
given. We stress that these equations hold for the most general case, i.e. including a 
general nonminimal coupling between matter and gravity. The results for standard 
(minimal) MAG are easily recovered by choosing a trivial coupling function F. 

Our analysis reveals how the new geometrical structures in generalized theories 
of gravity couple to matter, which in turn should be used for the design of exper- 
imental tests of gravity beyond the Einsteinian (purely Riemannian) geometrical 
picture. In the case of minimal coupling, we have once more confirmed — now in 
a very general context — that only matter matter with microstructure (such as the 
intrinsic hypermomentum, including spin, dilation and shear charges) allows for 
the detection of post-Riemannian structures. However, in gravitational theories with 



Table 1 Overview: MAG equations of motion 
Lagrangian and conservation laws 

L = V(gij, RijkK Nkih + F(gij, RuV , Nth 
Vj(FA‘C) = F(Y.k' - tk‘ + NnJ^"'k" - N„k"‘^‘m’‘) 

+ A’^jNkm")] = - FA’”J{Rkim'' - ^kNim") - LrJVkF 

Equations of motion {any order) 

See Eqs.(144) and (145) 

Equations of motion {pole-dipole order) 

^ + Fq‘^’’‘‘W“Ndcb - eV“F - F 

N N 

j^ab ^ 

= E{p^ + N^cdh^^) + p^^VbE = E{L^^ + S^^) 

Equations of motion {monopole order) 

M^ = -ag“'’-v‘‘v‘’)^ M = p‘>Vg 
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Table 2 Overview: scalar- tensor equations of motion (Einstein frame) 
Lagrangian and conservation law 

= 2^ {-R + - 2U) + ^LmatCV'. d'P, P~^9ij) 

F = F(ip^) U = U(ip^) -iab = 1ab(v^) 

V'Uj=^{Atij-gijt)d>F 

Equations of motion {any order) 

See Eq.(216) 

Equations of motion {pole-dipole order) 

^ = \R‘‘bcdv’>J‘=‘^ - iV“F-^ - 

Lab ^ 2p[“*l 

r“ = F-‘^p“ + p’’“VhF-‘^ J“’’ = 

Equations of motion {monopole order) 



nonminimal coupling, there seems to be a loophole which may proof to be interest- 
ing for possible experiments; i.e. there is an indirect coupling of new geometrical 
quantities to regular matter via the nonminimal coupling function F . This may be 
exploited to devise new strategies to detect possible post-Riemannian spacetime fea- 
tures in future experiments. 

In addition to the results in MAG, we have explicitly worked out the test body 
equations of motion for a very general class of scalar- tensor gravitational theories. 
Table 2 contains an overview of our main results in the context of this theory class. 
Again the equations of motion at the pole-dipole and monopole order for a general 
coupling function F, which now depends on the scalar degrees of freedom, are 
explicitly given. 

We hope that our covariant unified multipolar framework sheds more light on the 
systematic test of gravitational theories by means of extended and microstructured 
test bodies. We would like to conclude with a statement by Einstein [79] who stressed 
that 

[...] the question whether this continuum has a Euclidean, Riemannian, or any other structure 
is a question of physics proper which must be answered by experience, and not a question 
of a convention to be chosen on grounds of mere expediency. 
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Appendix 

A Conventions and Symbols 

In the following we summarize our conventions, and collect some frequently 
used formulas. A directory of symbols used throughout the text can be found in 



Tables 3, 4 and 5. 




Table 3 Directory of symbols 




Symbol 


Explanation 


Geometrical quantities 


x“,s 


Coordinates, proper time 


9ab 


Metric 




Kronecker symbol 


9 


Determinant of the metric 


a 


World-function 


rab‘^ 


Connection 




Riemannian connection 


rab^ 


Transposed connection 


Rab/ 


Curvature 


Qabc 


Nonmetricity 


Qa 


Weyl covector 


Tab^ 


Torsion 


Kab^ 


Contortion 


Nab^ 


Distortion 


J^abi ^ 


Ricci tensor/scalar 


(<^%)/' 


Generators of general coordinate 
transformations 


y, 


Gravitational Lagrangian (density) 


-.JAM ••■) 


Generalized gravitational field momenta 
(densities) 




Gravitational hypermomentum (density) 




Generalized gravitational energy-momentum 
(density) 


/ 


General action 


L, £ 


Total Lagrangian (density) 




General set of fields 


C" 


(Generalized) Killing vector 




General dynamical currents (densities) 


j 

9iJ 


Jordan frame metric 


J J 
L,£ 


Total Lagrangian (density) in Jordan frame 
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Table 4 Directory of symbols 


Symbol 


Explanation 


Matter quantities 


tp^ 


General matter field 


Ematj '^^at 


Matter Lagrangian (density) 


b <rr b 

5 


Canonical energy-momentum (density) of 
matter 


A"/, 6 “/ 


Canonical hypermomentum (density) of matter 




Metrical energy-momentum (density) 


rab"" 


Spin current 




Dilation current 


ytt 


Eour velocity 


P 


Pressure (hyperfiuid) 


Pa 


Momentum density (hyperfiuid) 


J ^ 

'•a 


Intrinsic hypermomentum density (hyperfiuid) 


F,A 


Coupling function 


pab 


Variation of the coupling function 


ja 


Induced conserved current 




General material currents (densities) 


j -, /•••, m ", p -, k -, 


Integrated moments 


h ", q - , p.-, ^ " 




pab 


Electromagnetic field 


r 


Electric current 


pab ^ab 


Total orbital/spin angular momentum 


pa ^ab 


Generalized total momentum/angular 
momentum 


pab 


Total hypermomentum 


M 


Generalized test body mass 


D 


Intrinsic dilation moment 




Multiplet of scalar fields 


n 


Einstein’s gravitational coupling constant 



For an arbitrary ^-tensor Ta^,„ak^ the symmetrization and antisymmetrization are 
defined by 



T{ai...ak) 



T[ai..Mk] 



1 ^ 

— / . T7ri{ai...ak}^ 
7=1 
^ kl 

~j~\ 

7=1 



( 235 ) 



( 236 ) 
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Table 5 Directory of symbols 


Symbol 


Explanation 


Auxiliary quantities 




Auxiliary variables (Noether identities) 




Partial derivatives of the total Lagrangian 


A) 


Auxiliary Lagrangian density 


0 0 


Partial derivatives of the coupling function 




General multi-indices 




General functions of external classical fields 


Uab/, 


Auxiliary variables (MAG equations of motion) 




Derivative of the coupling function 


f^^^yi-yn’ l^^yi---yn 


Expansion coefficients of the parallel 
propagator 


qubc qubc 


Decomposition pieces of the q-moments 




d 

Trace of the q moment 


J 

lAB, IaB 


General function of scalar fields (in Jordan 
frame) 


J 

U,u 


General potential of scalar fields (in Jordan 
frame) 




Auxiliary variable (Einstein frame) 


(byi-ynyo xuyi-yn+iyoyi 

^ XQ’ ^ -^ 0-^1 


Auxiliary variables multipole expansions 



Operators/accents 





(Partial, Lie) derivative 


Va 


Covariant derivative 


\a 


Riemannian covariant derivative 


Va 


Transposed covariant derivative 


Va 


Covariant density derivative 




Riemannian covariant density derivative 


* 

Va 


Modified covariant density derivative 


D _ 
ds 


Total derivative 


hs) 


Variation, substantial variation 


"B... ••• 


(Bi-)Tensor density 


G^x 


(Generalized) parallel propagator 


“7” 


Jordan frame quantity 


err? 


Riemannian quantity 


“[• • ■]” 


Coincidence limit 



where the sum is taken over all possible permutations (symbolically denoted by 
TTiiai . . .ak]) of its k indices. As is well-known, the number of such permutations is 
equal to k!. The sign factor depends on whether a permutation is even (|7t| = 0) or 
odd (|7t| = 1). The number of independent components of the totally symmetric 
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tensor T(^a\...ak) of rank k m n dimensions is equal to the binomial coefficient 
~ (^ — 1 + ^) !/[k!(^ — 1) !], whereas the number of independent components 
of the totally antisymmetric tensor T[ai...ak\ of rank kmn dimensions is equal to the 
binomial coefficient (^) = n\/[k\{n — k)\]. For example, for a second rank tensor 
Tab the symmetrization yields a tensor T(^ab) = \(Tab + Tba) with 10 independent 
components, and the antisymmetrization yields another tensor T[ab] = ^ {Tab ~ Tba) 
with 6 independent components. 

In the derivation of the equations of motion we made use of the bitensor formalism, 

see, e.g., [7, 8, 80] for introductions and references. In particular, the world-function 

2 



is defined as an integral cr(x, y) 






over the geodesic curve connecting 



the spacetime points v and y, where e = ±1 for timelike/spacelike curves. Note that 
our curvature conventions differ from those in [7, 80]. Indices attached to the world- 
function always denote covariant derivatives, at the given point, i.e. ay := V^cr, 
hence we do not make explicit use of the semicolon in case of the world-function. 
The parallel propagator by g^x y) allows for the parallel transportation of objects 
along the unique geodesic that links the points v and y. For example, given a vector 
at V, the corresponding vector at y is obtained by means of the parallel transport 
along the geodesic curve as y)V^. For more details see, e.g., [7, 8] or 

Sect. 5 in [80] . A compact summary of useful formulas in the context of the bitensor 
formalism can also be found in the appendices A and B of [26] . 

We start by stating, without proof, the following useful rule for a bitensor B with 
arbitrary indices at different points (here just denoted by dots): 



= [B. ,,y] + [B„,,] . (237) 

Here a coincidence limit of a bitensor B {x, y) is a tensor 

[B ]= lim B (x,y), (238) 

determined at y. Furthermore, we collect the following useful identities: 



^yoyixoyixi — ^yoyiyiwi — ^wiyoyiyi’ (239) 

g''^''^CFxiCFx2 =2cr = g^^^^ay^Gy^, (240) 

[a] = 0, K] = [ay] = 0, (241) 

[^^ 1 x 2 ] = [<^yij2] — gyiyi^ [<^^^ 2 ] — [<^yiV2] — ~gyiyi^ (242) 
[c^xix2vs] = [^viX2y3] = = [^yiyiys] ~ (243) 

[^^^yi;v2] = [^^^yiiyi] = (244) 

[/%i;v2V3] = ^^^%iy2y3- (245) 
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B Covariant Expansions 

Here we briefly summarize the covariant expansions of the second derivative of the 
world-function, and the derivative of the parallel propagator: 



/ ^ 1 



— • • • rr 

, . V VO. . Vh I T 



3"2 . . . rryk+\ 






y\yi-yk+\^ ^ 



9^^ x\\x2 — 9^ 



( 1 _ ^1 

-h ^ — 7 



VO , ,, n-y^ . . .rryk+^ 

y'y"y3-yk+2^ ^ 



( 1 ~ oo . 

-py^ / rry^-\-^ —^yo , ^y3,,,^yk+2] 

2 ^ y y2y3^ ^ T y3^2B--3')t+2^ ^ ) 

00 .. 

^ X\\X 2 — ^ X \9 X 2 ^ r^l Y'y"y 3 ...yk+ 2 ^ ^ ’ 



c'-‘x,„=c''y,j:h'" 

k=l 



- 'y ^vf ' • ’(jyky^ 

I / Y y2y3-yk+2^ ^ 



The coefficients a, (3, y m these expansions are polynomials constructed from the 
Riemann curvature tensor and its covariant derivatives. The first coefficients read as 
follows: 



.B dB, 

t Vt Vo Vo _ ( V' 



(BB)yi ’ 



~ {y2y3)y\^ 









(B^ BB)}'!’ 






In addition, we also need the covariant expansion of a vector: 



^ (-1)^ 



4 _ V ’ A 

— g X ^ , I ^yo;yi-yk 



(257) 
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C Explicit Form 



Here we make contact with our notation in [51] to facilitate a direct comparison to 
the results in there. 

We introduce the auxiliary variables 

^yi...ynyo^^ := ^yi . . . ^yn gyo^^^ (258) 

(259) 

Their derivatives 

n 

a=l 

+(j>’i ...(jyn + g^'^xo9^'x'-,z ) . (260) 

n 

a=\ 

+(J^^ ■ ■ ■ (J^'' g^° xQ-z, ( 261 ) 

can be straightforwardly evaluated by using the expansions from the previous 
appendix. 

In terms of (258) and (259) the integrated conservation laws (127) and (128) take 
the form: 

2 j ^y^-y''y°y'x,x'&o’‘'^^d^x2 = 

I 

+ 1 ^y^-y^y^y'xox'-,x"&'>"w^^cn:x2 

+ J vy"+^yi>y'---y"y^y\,x';y„+ie'‘°'‘''‘^di:x2, ( 262 ) 

jJ ^y^-*y\,%^^^^di:x2 = j 



( 263 ) 
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This form allows for a direct comparison to (29) and (30) in [51]. Explicitly, in terms 
of (258) and (259) the integrated moments from (141)-(143) are given by: 



pyi...ynyo 


(264) 


E(r) 




ky2...yn+iyoyi f 


(265) 


E(r) 




l^y2...yn+iyoyi f 


(266) 


E(r) 




y3...yn+2yoyiy2 f ^B...yn+2yoyi^^^^^y2^^0Wix2^v3 


(267) 


E(r) 




py2...yn+iyoyi f 


(268) 


E(r) 




(_i)n f 


(269) 


E(r) 
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Energy-Momentum Tensors 
and Motion in Special Relativity 



Domenico Giulini 



Abstract The notion of “motion” and “conserved quantities”, if applied to extended 
objects, is already quite non-trivial in Special Relativity. This contribution is meant 
to remind us on all the relevant mathematical structures and constructions that under- 
lie these concepts, which we will review in some detail. Next to the prerequisites 
from Special Relativity, like Minkowski space and its automorphism group, this will 
include the notion of a body in Minkowski space, the momentum map, a charac- 
terisation of the habitat of globally conserved quantities associated with Poincare 
symmetry — so called Poincare charges — the frame-dependent decomposition of 
global angular momentum into Spin and an orbital part, and, last not least, the like- 
wise frame-dependent notion of centre of mass together with a geometric description 
of the Mqller Radius, of which we also list some typical values. Two Appendices 
present some mathematical background material on Hodge duality and group actions 
on manifolds. 



1 Introduction 

This contribution deals with the “problem of motion” in Special Relativity. Thus we 
work entirely in Minkowski space M (to be defined below) and represent a material 
system by an energy-momentum tensor T the support of which is to be identified 
with the set of events (points) in Minkowski space where matter “exists”: 



supp(T) := {p G M I T(p) 7 ^ 0}. (1) 
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A central assumption will be that the material system is spatially well localised, which 
here shall mean that supp(T) has compact intersection with any Cauchy hypersurface 
in M. Note that Cauchy hypersurfaces end at spatial infinity /q and that supp(T) need 
not have compact intersection with asymptotically hyperboloidal spacelike hypersur- 
faces which tend to lightlike rather than spacelike infinity. This is depicted in Fig. 1. 

Definition 1 We say that an energy-momentum tensor T describes a body iff the 
intersection of supp(T) with any Cauchy hypersurface in Minkowski space is com- 
pact. □ 

Hence we identify the event-set of a body with supp(T), which, in the sense made 
precise above, is of finite spatial extent, though it clearly will extend to timelike 
infinity. This is visualised as the a tubular neighbourhood stretching all the way from 
past-timelike to future-timelike infinity, as indicated by the shaded vertical tube in 
Fig. 1. It is also clear from Fig. 1 that we generally cannot require compact support 
of T on spacelike hypersurfaces which are not Cauchy, like L. In fact, if the body 
radiated in the finite past, given by the lighter- shaded part of the tubular region in the 







Fig. 1 History of a compact object in the conformal compactification of Minkowski space (Penrose 
Diagram). The five asymptotic regions of Minkowski space are future/past-timelike infinity I± 
(each a single point), future/past-lightlike infinity (each a three-dimensional lightlike manifold 
of topology RxS^), and spacelike infinity Iq (a single point). The representation is not quite faithful 
because spacelike infinity, here represented by two points, is really just a single point. A faithful 
representation is obtained by wrapping the diamond-shaped 2-dimensional figure around a cylinder 
(R X S^), so as to identify both points Iq of the diagram to a single one. S and L are both spacelike 
hypersurfaces stretching out to “infinity”. But only S, which stretches out to spacelike infinity, is a 
Cauchy surface, i.e., covers all of spacetime in its domain of dependence 
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lower half of Fig. 1, the radiation will propagate to and cover a neighbourhood in 
L of its 2-sphere of intersection with which is of non-compact closure. This can 
be avoided for spacelike hypersurfaces ending at /q if we require a neighbourhood 
of 1- to be free of radiation. This means that the body started to radiate a finite time 
in the past and that there is no incoming radiation from arbitrarily close to /q. In 
fact, describing a quasi-isolated body would presumably mean to exclude incoming 
radiation altogether. This explains our motivation for Definition 1 . 

A body should possess globally conserved quantities like linear and angular 
momentum. These are usually written down in a formulae like 



where E is a Cauchy surface, are the components of its future-pointing normal, 
and d/i is the measure on E induced from the ambient spacetime. See [1] for a 
conceptually exceptionally clear discussion. 

The problem with these expressions is that, on face value, they do not make any 
sense. For one thing, the integrands are vector/tensor valued, and adding them at dif- 
ferent points does not result in anything with an obvious meaning. If we wish to inter- 
pret as the a\h (covariant) component of the vector of total linear momentum, we 
should characterise the vector space of which P is an element. And, moreover, what 
does it mean to say that total linear (four-)momentum transforms like a four- vector 
(here covariant)? Likewise, we wish to interpret as the ahih (contravariant) 

component of the antisymmetric 2nd-rank tensor of angular momentum with respect 
to the centre z. Again it is unclear what tensor space this /[z] is an element of and 
what is meant by stating its representation property under Poincare transformations. 
Are these spaces defined at points in spacetime, perhaps at “infinity”, or in an abstract 
vector/tensor space globally associated to (but not in) spacetime? Also, the differ- 
ence (x^ — z^) that appears in (2b) also makes no immediate sense. Is it supposed 
to be the a\h component of some “difference function” on spacetime? Is it supposed 
to make sense in all coordinate systems, or just special ones; and if the latter holds, 
what selects these special ones? 

Clearly, all these questions do have answers, but these answers delicately depend 
on the precise mathematical structures with which spacetime is endowed. In our 
(highly idealised) case of Minkowski space, it is the high degree of symmetry of 
spacetime that allows us to naturally interpret (2) so as to make unambiguous mathe- 
matical and physical sense. Removing or weakening these structures and pretending 
the expressions (2) to still make sense without further qualifications means to commit 
a mathematical and conceptual sin. This does not mean that (2) cannot be meaning- 
fully generalised, but these generalisations will generally not be natural in a mathe- 
matical sense, that is, they will depend on additional structures and constructions to 
be imposed or selected “by hand”. The physical interpretation of what is then actually 




(2b) 



(2a) 
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Fig. 2 As emphasised by 
this conference logo, a 
central problem is to 
associate a timelike curve S 
to the energy-momentum 
tensor T. One would expect 
the line S to lie in the 
“convex hull” of the support 
of T, here represented by the 
extended tube S 




represented by the integrals (2) will delicately depend on these by-hand additions. It 
is therefore the aim of this introductory exposition to clarify the mathematical and 
physical meaning of (2) in the simplest case, i.e. in Special Relativity. My strategy 
will be to fully display all the ingredients that go into the proper definition of (2). 
This, hopefully, will help to distinguish the generic difficulties of the gravitational 
case from those merely inherited from Special Relativity. 

Related to the issue of giving proper meaning to (2) is the definition of “centre 
of mass” of an extended object. As you can see from its logo, this is a central 
concern of this conference (see Fig. 2). If “motion” is the change of position in time, 
we need to be clear about how to define “position” in the first place. The issue of 
how to define position observables in any special-relativistic theory, classical and 
quantum, is notorious. See, e.g., [2] for a good account. In my contribution I will 
give a derivation of the Mpller radius which represents the ambiguity of defining 
position for systems with “spin”, i.e., “intrinsic angular momentum”, a notion also 
to be defined. So let us start at the beginning, asking for the reader’s patience! 



2 Minkowski Space and Poincare Group 

In this section we wish to recall the definitions of Minkowski space and its automor- 
phism group, despite the fact that this is generally considered a commonplace. But 
we think that there are some subtleties, in particular concerning the characterisation 
of its automorphism group, the Poincare group, that deserve to be said more than 
once. We start with 

Definition 2 Minkowski space is a quadruple (M, V, r^, -h), consisting of: 

1. A set, M, the elements of which are called spacetime points or events. 

2. A real 4-dimensional vector space V. 
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3. A simply transitive action of V, considered as a group, on M, denoted by +, i.e., 

M X V ^ M, (p, v) \-^ p V. (3) 

4. A non-degenerate symmetric bilinear form rj G V* ( 8 ) V* of signature 

(+ 1 ,- 1 , - 1 ,- 1 ). □ 

Remark 3 Every non-degenerate bilinear form rj : V x V ^ on a. vector space 

V defines an isomorphism : V V* to its dual space V* via the requirement 

p^(v)(w) := p(v, u;)foralli;, w G y;inshort, i; Pi(v) \= p(v, •)• Us inverse map 
is : V* ^ V , := (j]^) \ which in turn defines a non-degenerate bilinear form 

on the dual space, 7/“^ : V* x V* ^ M, via the requirement /^) •= ct(^t(/^)) 
for all a, (3 g V*. On component-level this reads as follows: Let {ea \ I < a < n] 

be a basis of V and {0^ \ I < a < n] its dual basis of V*, so that O^iet) = 

Then, writing v = v^ea, we get p^(v) = with Vb := v^Pab and pab •= 

V(^a, Similarly, writing a = aaO^, wo get p^(a) = a^ea with := p^^ab 
and p^^ := p~^(0^, 0^). This implies 5^ = p^^pbc = V^^Vcb = ^b particular, 

pab = pa^p^^pcd and Pab = P^^PcaVdb- This explains why p^ and p^ are called the 
operations of “index-raising” and “index lowering”. Sometimes the images of p^ 
and p^ are indicated by the musical symbols (sharp) and b (fiat) respectively, i.e., 
one writes p^(a) = and p^(v) = which makes sense as long as the bilinear 
form p with respect to which these maps are defined is self understood. We shall 
also employ this notation. Note that so far we did not assume p to be symmetric, 
so that all formulae apply generally. However, from now on, and for the rest of this 
paper, the symbol p shall always denote the Minkowski metric, which specialises the 
general case by symmetry and signature. Once p is fixed, the isomorphisms between 

V and y* as well as its extensions to tensor products is clear from the context and 
it is sufficient and useful to use shorthand notations, like v • w := p(v, w) = 

:= V • V, and ||i;|| := V^iTT+J, Given J = ja^ea <S> eb ^ V <S> V and v e V, wo 
shall also write / • f or i; • / for the application of ja^ea p^i^b) = ^ 0^ e 

End(y) or ja^eb p^i^a) = Ja^^b Oa e End(y), respectively, to v. The inner 
products on V and y* can be used to define inner products on any space built by 
taking tensor products of V and y * just by slotwise contraction. However, in certain 
circumstances of high symmetry, e.g., for totally antisymmetric tensor products, it is 
more convenient to renormalise the slotwise inner product by combinatorial factors; 
like in formula (133) of the Appendix. Einally we recall that the transposed of a 
general linear map A : V ^ W between vector spaces V and W is the linear map 
: W* ^ y*, defined by A^(a) := a o A for all a G W*. There is a natural 
isomorphism between a vector space V and its double dual y**, so that we may 
identify these spaces without explicit mention. Symmetry of p is then equivalent to 
pj = p^ and symmetry of 77 “^ to 77 !^ = ^t* 
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2.1 Affine Spaces 

Note that 1-3 define the notion of an affine space. Minkowski space is thus just a real 
4-dimensional affine space, the associated vector space of which carries a Lorentz 
metric. Any vector space V is a group under addition, with group identity being given 
by the zero vector and the inverse of i; g V being — u. It is customary to use the same 
symbol, -h, for the addition of vectors in V and the action of V on M. This allows to 
write the action property in the intuitive form (compare Appendix “Group Actions 
on Manifolds” for the general definition of a group action on a set) 

p -\- (v w) = {p v) w =\ p V w . (4) 

But note the different meanings of -h in this equation. Moreover, we define the 
subtraction of a vector by the addition of the inverse: 

p-v\=p + {-v). (5) 

This allows one more simplifying notation: Since V acts simply transitive, there 
exists a unique i; G V for any given pair (p,^)gMxMso that p = q -\- v. We 
write 



v = p-q. (6) 

Hence the minus sign should be understood as difference map MxM ^ Y,(p,q) 
p — q, defined through p = q + (p — q). Simple transitivity then implies 

(p - o) + (o - q) = p - q, (7) 

which is equivalent to 

p + (q - o) = q + (p - o). (8) 



2.2 Linear and Affine Frames 



Definition 4 A frame F for an affine space (M, V, -h) consists of a tuple F = (o, /), 
where (9 G M and / G Lin(IR'^, V) is a frame of the vector space V. Recall that a 
frame / of an ^-dimensional real vector space V is an isomorphism from to 
V. This is equivalent to choosing n linear independent vectors [e\, ... ,en] C V, 
the images under / of the canonical basis of W . The map / is then defined by 
linear extension: /(r^ , . . . ,r^) = X«=i Us inverse map is given by f~^(v) = 
. . . , where {0^, . . . ,0^} C V* is the dual basis of {^i, . . . , en}, i.e., 

0^(et,) = S^. Similarly, an affine frame F defines a bijective map between and 
the underlying set M, denoted by the same letter F and defined by 
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F : 



M, 



(ri, . . . , r„) F(ri,...,r„) ; = o + /(r\ . . . , r") 



= o + 'y'r^eg 

< 1=1 



(9) 



The inverse map is 

: = f-\p - o) 

= [e^{p-o),...,9’^{p-o)). (10) 

Given two frames F = {o, f) and F' = {o' , f'), they are related by F = F' o 
(F'~^ o F), where 

F'“* o F : R" ^ R" , (r, (11a) 

with 

r'\r\ . . . , r") = [f~\o - o') + o f{r\ r")f 

n 

= e'‘'(o-o') + Y,e'\eb)r‘’. (11b) 

b=l 

We denote the set of all affine frames of M by ^m- Q 

Remark 5 Affine spaces naturally inherit a topology from W . It is defined to be 
the unique topology on M for which all frame maps (9) are homeomorphisms, i.e., 
F and F~^ are continuous (hence F is an open map). Note that if a particular 
F' is a homeomorphism, than so is any other F, for F = F' o (F'~^ o F) and 
F'~^ o F ^ W, given by (11b), is clearly a homeomorphism. Hence the open 
sets in M are precisely the images of open sets in under any F. Moreover, affine 
frames endow M with the structure of a smooth (C^ , or even analytic) manifold since 
each frame defines a global chart with analytic transition functions (11b) between 
those charts. □ 

Definition 6 Affine frames define special, globally defined coordinates which are 
called affine coordinates or, in a physical context, inertial coordinates. Using these 
we may regard affine spaces as smooth (C^ , or even analytic) manifolds, as explained 
in Remarks. 

Recall that the algebra of all linear self-maps of a vector space V onto itself is 
denoted by End(V) (endomorphisms). The subset of all invertible elements in End(V) 
is called GL(V); it forms a group, the general-linear group (of self-isomorphisms, or 
Automorphisms regarding its structure as vector space) of V. Accordingly, End(M'^) 
is just given by the algebra of all real n xn matrices and GL(M”) by the group of all 
n X n matrices with non- vanishing determinant. 
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A frame of V defines an isomorphism of algebras End(V) ^ End(M'^) through 
:=/oAo/“^.Its restriction to GL(V) defines an isomorphism of 
groups GL(V) ^ GL(M^). Let us denote by Ty the set of all frames of V. There 
are two natural left actions of groups on Ty\ GL(V) acts on the left according to 
(A, /) A o / and GL(M'^) also acts on the left according to {B, /) / o . 

Note that {B, f) \-> f oB would be a right action; see Appendix “Group Actions on 
Manifolds” for a general discussions of group actions. Both actions commute and are 
each simply transitive. A combined left action of GL(V) x GL(IR^) on according 
to ((A, B), f) Ao f o B~^ results. The action of GL(M'^) is sometimes called 
passive since it merely moves the labels (coordinates) in label-space W, whereas 
GL(V)’s action is called active since it really moves the points in the space V. Note 
that these adjectives refer to different groups, which are isomorphic but not naturally 
so since picking any isomorphism requires extra choices to be made. Eor example, 
picking a frame /, an /-dependent isomorphism GL(IR'^) ^ GL(V) is defined 
through the stabiliser subgroup in GL(V) x GL(M'^) that fixes / under the common 
left action just described. This isomorphism then simply reads GL(M'^) 3 B \-^ A := 
f o B o f~^ G GL(V) (so that A o / o B~^ = /), which then also defines a frame- 
dependent left action of GL(M^) on V. With respect to the fixed frame / the latter 
can then be used to define “active” transformations on V by means of what previously 
had been interpreted as mere label (coordinate) transformations. Tailing to clearly 
state the groups, their domains of action, and the structures to be considered fixed is 
often the source of considerable confusion regarding the distinction of “active” and 
“passive” actions. 



2.3 Affine Groups 

Definition 7 Let (M, V, +) be an ^-dimensional real affine space. The affine group, 
denoted by Aff(M), is the group of automorphisms of (M, V, +). This means that 
Aff(M) is the subgroup of bijections of M preserving the simply transitive action 
V on M. The word “preserving” means that for each H g Aff(M) there exists a 
unique h G Aut(V) so that H{p -h i;) = H{p) -\- h{v) for all G M and all i; G V. 
Here Aut(V) is the automorphism group of V, which is GL(V) if we consider its 
structure as vector space or as topological group, i.e., GL(V) are the continuous 
automorphisms of the topological group V. 

Aff(M) ;={//: M ^ M I H{p + i;) = H{p) + h{v ) , h e GL(V) , Vv e V}. 

( 12 ) 



Note that this definition makes sense, for if p' -\-v' = p + v,or ip' — p)-\-v' = n, we 
have H{p' + n') = H{p') + h(v') = H{p + {p' - p)) + h(v') = H{p) + h{(p' - 
p) + v^) = H{p) + h{v) = H{p + v). □ 
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Remark 8 We said that Aut(V) is GL(V) if we consider V either as vector space or 
as topological group, comprising all the continuous automorphisms in the latter case. 
This qualification is indeed necessary, for if we considered V merely as algebraic 
group, as it might seem sufficient at this point, Aut(V) would indeed be very much 
larger than GL(V) in that it will also contain all the wildly discontinuous automor- 
phisms that V inherits from the likewise wildly discontinuous automorphisms of the 
algebraic group (M, +). The latter are the discontinuous solutions / : M t-^ M to the 
so-called Cauchy functional equation, f{x-\-y) = / (v) -h / (y ) , which are also bijec- 
tions. It is elementary to show that all its solutions necessarily satisfy f{qr) = qf{r) 
for all ^ G Q and all r g M. This implies that f{q) = ^/(l), i.e., that / is linear 
with slope c \= /(I) on all rational numbers, and hence linear with slope c on all 
real numbers if / were required to be continuous (requiring continuity at one point 
is sufficient). Without requiring continuity we can only conclude that for fixed r g M 
and all ^ G Q we must have f{rq) = rq (/(r)/r), i.e., that / is again linear on the 
r -multiples of the rationals, but now with possibly r -dependent slope c(r) := /(r)/r. 
Indeed, plenty of such discontinuous solutions exist and can be constructed as fol- 
lows [3]: Consider R as vector space over Q and let 5 c R be a (Hamel) basis, 
i.e., for each r g R there exists a unique finite subset {e\, ... ,en) C B and unique 
{qi, ... , qn) G Q^, such that r = As was shown in [3], the existence 

of such a basis follows from the well-ordering theorem, though the cardinality of 
B is that of R, i.e., the basis is uncountable. Now, any bijection f \ B ^ B gives 
rise to an element of Aut(R, +) by uniquely extending / from B C R to R in a 
Q-linear fashion, i.e., by setting fC^qtei) := ^qifict) for all finite linear com- 
binations of elements in B over Q. Moreover, if the initial permutation f : B ^ B 
is not linear, i.e., if the function B 3 e \-> f(e)/e is not constant, the automor- 
phism / : R ^ R so defined is “wildly” discontinuous, in the sense that its graph 
{(x, f(x)) I X G R} C R^ is dense! In particular, given any x G R, the image of 
any interval containing x under / is dense in R, no matter how small the intervall 
was chosen to be. To see this, consider e\,e 2 e ^ so that f(ei)/ei / /(^2)/^2- 
Given any (x, y) G R^ we can uniquely solve the two equations x = r\ei -h T2^2 
and y = rif{e\) -h r 2 f(e 2 ), i.e., the single linear equation. 



for (ri, T 2 ) G R^ by rational operations, since the 2x2 matrix in (13) is invertible. 
In particular, (x, y) depends continuously on (ri , T2) so that with rational (q \ , ^ 2 ) ^ 
in a neighbourhood of (ri, r 2 ) we get arbitrarily close to (x, y), as was to be 
proven. All this implies that the usual abelian group structure underlying vector 
addition cannot be uniquely specified without requiring continuity. Interestingly this 
problem was first encountered in analytical mechanics in connection with attempts to 
mathematically characterise the law for the composition of forces [4] and only later 
recognised as essential for general axiomatic formulations of vector addition; see, 
e.g., [5]. For us all this means that we cannot avoid invoking a continuity hypothesis 




(13) 
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and that we must regard the abelian groups whose simply transitive action we require 
in the definition of affine spaces as topological groups acting continuously on affine 
space with its natural topology inherited from M”; compare Remarks. One might 
think that one gets away without continuity requirements if one defines Aff (M) as that 
subgroup of the group of bijections (no continuity required here) of M which maps 
straight lines (physically: inertial trajectories) into straight lines (collinear sets of 
points into collinear sets would also suffice). A classic result in affine geometry then 
tells us that such transformations necessarily coincide with the standard continuous 
affine transformations; see, e.g., [6]. However, here a continuity requirement has 
tacitly slipped into the notion of “straight line” (inertial trajectory), which in affine 
space is defined to be the orbit of a continuous one-parameter subgroup of V. □ 

Coming back to the group of affine automorphisms as defined above, we see that 
Hence an element H g Aff (M) is uniquely specified by an ordered pair of points 
ip,q) G M X M and an element h g GL(V). The second point q is regarded as 
the image of the first point p under the map in question, whose definition is now 
given by H{p -hi;) q -\- h(v). Two such maps, H and H' , characterised by 
(p, q, h) and {p' , q\ h'), respectively, are easily seen to be the same iff h = h' and 
q' — q = h{p' — p) . This defines an equivalence relation on the set M x M x GL(V), 
the equivalence classes of which are 

[p, q,h]= \^(^p + v,q h(v), h^. (14) 

veV 



Hence we may identify Aff(M) with this quotient space and write H = [p,q,h] for 
any H g Aff (M). The composition of two maps H = [p,q,h] and H' = [p^ q' , h'] 
can then be calculated 

H' o H{p + i;) = H'{q + h(v)) = H'{p' + (q - p') + h{v)) 

= q' -\- h'iq — p') + h' o h{v). (15) 



In other words 



[p', q', h'] o [p, q, h] = [p, q' + h'(q - p'), h' o h]. (16) 

The first thing to note is that the equivalence class on the right-hand side is 
unchanged if we replace (p,q,h) with (p -h v,q h(v),h) or (p',q\ h') with 
{j>' -h v' ,q -h h'(v'), h'), which means that this prescription written down in terms 
of representatives defines indeed a multiplication of equivalence classes. Note that 
the neutral element is [p, p, idy] and the inverse of [p,q,h] is 

[p, q, h]~^ = [p,p- h~^{q - p), /?“*]. (17) 

Furthermore, it is easy to check that (16) is associative and hence defines a group 
multiplication. 
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An obvious subgroup in Aff (M) is given by the following subset 

Trans(M) := {[p, q, h] e Aff(M) | h = idy}. (18) 

This subgroup is abelian, 

[p\ q', idy] o [p, q, idy] = [p, q' + (q - p'), idy] 

= [p' + ip-p'),q' + (q-p'),idv] 

= [p',q' + (q - p') + (p' - p), idy] (19) 
= [p', q' + {q - p), idy] 

= [p\ q + (q' - p), idy] 

= [p,^,idy] o [p',q', idy], 

(using (7) and (8) at the fourth and fifth equality) and normal, 

[p, q, h] o [p', q', idy] o [p, q, h]~^ 

= [p, q, h] o [p', q', idy] o[p, p - h~^(q - p), h~^] 

= [p, q, h] o [p, q' + (p- p') - h~Uq ~ p), h~^] (20) 

= [p,q+ h{q' - p') + {p- q), idy] 

= [p, p + h(q' - p'),idy]. 

It is called the subgroup of translations. It is the kernel of the projection homomor- 
phism 

7T : Aff (M) ^ GL(V) , [p,q,h]h^ T:{[p,q,h]) := h. (21) 

If we denote the embedding (injective homomorphism) of Trans (M) into Aff(M) 
by /, we have the short sequence of groups and maps 

{1} ^ Trans(M) ■4 Aff(M) ^ GL(V) ^ {1}. (22) 

Here {1} stands for the trivial group with unique group homomorphisms from and to 
any other group. The hooked arrow and the harpoon indicate injective and surjective 
homomorphisms respectively. This may be briefly summarised by saying that the 
short sequence is exact, where exactness means that at each group the image of the 
arriving map is the kernel of the departing one. 

Moreover, our sequence (22) is not only exact but it also splits. By this is meant that 
there are also group embeddings (injective homomorphisms) j : GL(V) ^ Aff(M) 
so that 7 T o 7 = idGL(V) • To see this, choose a point o eM and define (indicating the 
dependence of j on (9 by a subscript) 



jo : GL(V) ^ Aff(M) , h ^ jo(h) := [o, o, h]. 



(23) 
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Since [o, o, h'] o [o, o, K\ ~ [o,o h\o — o), h'h] = [o, o, h'h] one has indeed 
io(h')io(h) = ioQi'h) and /o(idGL(V)) = idAff(M), that is, io is a group homomor- 
phism. But note that we needed to select a point <9 g M to define the embedding. 
Two embeddings corresponding to different choices o and o' are related by conjuga- 
tion with the translation from o io o' . Indeed, using that according to (17) we have 
[(9, (9^ idvl“^ = [(9, (9 — {o' — o), idvl = [o', o, idvl, we have for all h e GL(V) 

[o, o' , idvl o ioQi) o [o, o' , idy]”^ = [o, o', idy] o [o, o, h] o [o', o, idy] 

= [o, o', idy] o [o', o, h] (24) 

= [o\o',h] 

= io'ih). 

The relation between the three groups Trans (M), Aff(M), and GL(V) can then be 
compactly expressed by completing the short exact sequence (22) by a splitting 
homomorphism jo'. 



7T 

{1} ^ Trans(M) <4 Aff(M) ^ GL(V) ^ {1}. (25) 



This characterisation in terms of a split exact- sequence is the most natural in view of 
the homogeneity of M. The usual characterisation by means of a semi-direct product 
V X GL(V) is unnatural insofar as the GL(V) subgroup in Aff(M) depends on the 
choice of a point o e M, violating homogeneity. What one may say is that Aff(M) 
is isomorphic to V x GL(V), but the isomorphism depends on the selection of a 
point. Only after the point is selected can we locate a linear subgroup in Aff (M) 
isomorphic to GL(V), namely the image of GL(V) under the embedding jo (23). 
Once one agrees to select a point o e M, we may write the general element of 
Aff(M) in the form [o,q,h]. Group multiplication according to (16) then becomes 

[o, q' , h'^ o [o, q, h] = [o, q' -h h'(q — o), h' o h] = [o, o + {q' — o) + h' {q — o), h' o h]. 

(26) 

Having selected o we may identify M with V via p \-^ p — o (sometimes called the 
“vectorialisation” of M at (9 [6]) and the group Aff(M) with the set V x GL(V). A 
general group element may then be written [o,o-\-v,h] \-> (v,h) and (26) becomes 

{v', h') o {v, h) = [v' -h h'{v), h' o h), (27) 

which is just the product structure of a semi-direct product V x GL(V) with respect 
to the homomorphism GL(V) ^ Aut(V) that is given by the defining representation 
ofGL(V). 

Remark 9 The proper statement regarding the structure of the affine group Aff (M) 
is that it is a downward splitting extension of GL(V) by Trans(M), as summarised by 
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(25). To be a downward extension^ means that Trans(M) is a normal (or “invariant”) 
subgroup of Aff (M) so that the quotient Aff (M) /Trans (M) is isomorphic to GL(V). 
To be “splitting” means that GL(V) may be identified with a subgroup in Aff(M) 
whose intersection with Trans (M) is merley the group identity. In our case there exist 
many such splitting embeddings of GL(V) into Aff(M), so that there is no unique 
way to regard GL(V) as subgroup of Aff(M). The ambiguity is faithfully labelled 
by the points in M (the point that is fixed under the action of the embedded copy of 
GL(V) in Aff (M) on M). Given such a splitting, Aff (M) becomes isomorphic to the 
corresponding semi-direct product V xi GL(V). But this isomorphism depends on 
the choice of a point in M. If one says that Aff (M) is isomorphic to V x GL(V) one 
should add that this isomorphisms is not “natural”, since by the very homogeneity 
of M there is clearly no preferred choice of a point in M. 



2 A Poincare Group 



Definition 10 Given Definition 2 of Minkowski space, we define the Poincare group, 
Poin(M), to be its group of automorphisms. This means that is must consists of affine 
transformations including all elements in Trans (M), such that Aff (M) /Trans (M) = 
Lor(V) C GL(V), where 

Lor(V) := |/j e GL(V) | r]{h{v), h{w)) = r}[v, w) , Vi;, w e vj. (28) 
Hence we have 

Poin(M) := {H : M ^ M I H(p + v) = H{p) + h{v), h e Lor(V) , Vv e V}. 

(29) 

□ 

Totally analogous to (25), this leads to the splitting exact sequence 

7T 

{1} ^ Trans(M) Poin(M) ^ Lor(V) ^ {1} (30) 



and to the (9 G M dependent(!) isomorphism 



^Here we recall that the usual terminology regarding extensions of groups is not quite uniform and 
hence ambiguous. Suppose three groups H, E and G are related by an exact sequence 1 ^ H ^ 
E -> G -> 1, i.e. that // is a normal (or “invariant”) subgroup of E with quotient E /H isomorphic 
to G. Then this state of affairs is usually simply expressed by either saying that E is “an extension” 
of G by //, 6>r of // by G. This ambiguity arises because views differ as to whether one likes to 
regard the extending or the extended group to be that one which becomes normal in the extension. 
To avoid such ambiguities the following refined terminology has been proposed in [7] : E is called 
an upward extension of // by G, or a downward extension of G by //. 
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Poin(M) = y X Lor(V). 



( 31 ) 



If we complete 6> to a full affine frame F = (o, f), where / g Lin(IR'^, V), and if 
in addition we require / to map the standard basis of W to the orthonormal basis of 
V with respect to r], i.e., r]ab •= ^b) = ~^^ab with one plus and n — I minus 

signs), we may identify M with and then have 



Now, Poin(M) is a Lie group. The structure of a differentiable manifold with 
respect to which all group operations become smooth are again obtained by its 
isomorphism (non-naturalness is irrelevant here) with the matrix group just described. 
Note that the semi-direct product (32) can itself be embedded (i.e. mapped by an 
injective homomorphism) into the group GL(M'^+^), via 



which endows it with the differentiable structure inherited from GL(M'^+^). All this 
is using the preferred affine (or inertial) coordinates of M; compare Definition 6. 
Note also that the group multiplication in Aff(M) has been explained simply by 
composition of maps (Aff(M) was defined to consists of special bijections of M). 
This defines a left action of Aff (M) on M and hence, by simple restriction, a left 
action of Poin(M) on M. This, in turn, defines an anti-homomorphism between the 
Lie -algebra of Poin(M) and the Lie algebra of vector fields on M (considered as 
differentiable manifold), where the Lie-algebra structure of the latter is defined by 
the commutator of vector fields. The reason why we have an anti- rather than a proper 
homomorphism of Lie algebras is explained in detail in Appendix “Group Actions on 
Manifolds”, in which we also review in some detail the notion of Lie-group actions 
on manifolds. 

We recall that the Lie algebra, lor(V), of Lor(V) is the linear space of endomor- 
phisms A g End(V) which are antisymmetric with respect to the Minkowski inner 
product T], i.e., satisfy r](Av, w) = —r](v, Aw) for all i;, u; G V. Using the r^-induced 
isomorphism r/; : V ^ V* and its inverse (compare Remarks), we can then write 
down the projection operators Ps, Pa • End(V) ^ End(V), which project onto the 
7^-symmetric and 7^-antisymmetric endomorphisms: 



Poin(M^) = X Lor(M^). 



(32) 



where 



Lor(M") := {L e GL(7?") | = rj,d}- 



(33) 




(34) 



Ps(M) := ^{M + t]^oM^ ot]A 
Pa(M) := otjA 



(35a) 

(35b) 
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Hence lor(V) can be either characterised as the kernel of Ps or the image of Pa in 
End(V). Using the first option we may write 

[or(V) = Kcr(Ps) = {A e End(V) | A = o o (36) 

Using the point-dependent isomorphism (31), the Lie algebra of Poin(M), denoted 
by poin(M), is the semi-direct product of the Lie algebras V and lor(V). Note that 
V, considered as abelian group, has a Lie algebra which is isomorphic (as vector 
space) to V with trivial Lie product (i.e. all Lie products are zero). Then we get the, 
likewise point-dependent, isomorphism 

poin(M) = V X lor(V) 

= |(i;, M) e V X )ot(V) | [(v, M), (w, A^)] = {Mw - Nv , [M, yV])|. (37) 

Here Mw is the action of M e End(V) on w e Y and [M, A^] is the commutator, 
which turns End(V), considered as associative algebra, into a Lie algebra. An easy 
way to see that (37) does indeed give the right Lie product is to use the embedding 
(34), which induces an embedding 

M" X [or(R") 9 (i;, M) ^ e End(R"+‘). (38) 

The Lie product of the images of (i;, M) and (w, N) is then just their commutator, 
which is immediately seen to be the image of (Mu; — Nv, [M, A]). 

Now, as already mentioned above, the left action 

O : Poin(M) x M ^ M , (g, m) \-^ <^(g, m) = 0^(m) (39) 

of Poin(M) on M induces a linear map from poin(M) to the linear space of vector 
fields on M, denoted by Vec(M) (smooth sections in TM). This map is just the differ- 
ential of O with respect to the first (group valued) argument evaluated at the group 
identity. This is explained in all detail in Appendix “Group Actions on Manifolds”; 
compare (154). Since Vec(M) is itself a Lie algebra, where the Lie product is defined 
to be the commutator of vector fields. With respect to these two Lie structures, the 
linear map poin(M) 3 X \-^ G Vec(M) is a Lie anti-homomorphism. Again we 
refer to the Appendix “Group Actions on Manifolds” for details; compare (172b). 
Hence we have 

[v\ v^] = ( 40 ) 

where the “anti” is reflected by the minus-sign on the right-hand side. 

Moreover, as the left action of Poin(M) on M lifts by push-forward (differential 
of O with respect to second (M- valued) argument) to a left action on TM and hence 
Vec(M), we can ask for the result of acting with g e Poin(M) on the special vector 
field V^. The result is [see Eq. (173a) of Appendix “Group Actions on Manifolds”] 
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= O^g. (41) 

where Ad denotes the adjoint representation of Poin(M) on poin(M). 

Let us at this point say a few words about the adjoint and co-adjoint representation; 
the latter will become important in what is to follow. An easy way to calculate 
the adjoint representation is again to identify Poin(M) and poin(M) according to 
(32) and (37), respectively, and perform the easy conjugation-calculation using the 
embeddings (34) and (38). The result is 



Ad(a,L)(i;, M) = (Lv - LML~^v, LML“*). (42) 

The co-adjoint representation is the usual representation induced by Ad on the dual 
space, that is, the inverse transposed. As a vector space, poin(M) is isomorphic to a 
linear subspace of V 0End(V), namely the image of idy 0 Pa • Note that V 0End(V) 
may be identified with V 0 (V 0 V*). The dual of the vector space poin(M) is then 
isomorphic to a subspace of the dual to V0 (V 0 V*), i.e., a subspace of V* 0 (V* 0 V) . 
This subspace is the image of idy* 0 P~[. It is called the dual of the Lie algebra 
poin(M), denoted by poin*(M). It is merely considered as a vector space, not a Lie 
algebra. The natural paring between (p, J) g poin*(M) and (u, M) g poin(M) is 

[{p, 7)](i;, M) = p(v) + iTr(/^ o M). (43) 

The factor 1/2 in the second term is introduced because M obeys the condition 
Ps(M) = 0 and each independent component of M contributes twice to the trace. 
We have, by definition of the transposed map, Tr(/^ o PaM) =: Tr((P^ /)^ o M) 
and likewise for Pj, which immediately leads to the expressions 



Pj{J) := +r,^oJ^ op,). 


(44a) 


Pj{J) := -p,oJ^ op,). 


(44b) 


Hence we may characterise lor*(V) by: 




[or*(V) = {7 e End(V*) \ J = -p, o o p,}, 


(45) 


and furthermore (as vector spaces) 




poin*(M) = V* X [ot*(V). 


(46) 



As already said, the co-adjoint representation. Ad* of Poin(M) on poin* (M) is defined 
to be the inverse-transposed: 



Ad* : Poin(M) x poin*(M) ^ poin*(M) 

((a, L ) , (p, J)) Ad*^_^)(;>, /) := (p, J) o Ad^|^. 



( 47 ) 
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Note that the inverse is necessary to get a left action, i.e., Ad*^ o Ad*^/ = 

Using 

Ad“|^)(u, M) = {L-'-v + L“^Ma , L“^ML), (48) 

a straightforward calculation gives, writing L := {LJ)~^ and using the identity 
p{w) = Tr(n; (g) p), valid for any w eY and g V*, 

+ lTr([2Lp ® a + LjL~^Y My (49) 

This implies 



^^*a,L)(P^ = [PP’ LJL~^ + pj (2 Lp® a) 

= {Lp, LJL-^ + Lp®a-a'^ ®(Lp)^). (50) 

For what follows it is important to compare the adjoint representation (42) of 
Poin(M) on poin(M) with the co-adjoint representation (50) of the same group on 
poin*(M). This is not quite straightforward since the representation spaces are dif- 
ferent and hence it is not entirely obvious how to best appreciate their difference. 
However, it is true that, as vector spaces, poin(M) and poin*(M) are isomorphic, 
though not naturally so. We need an extra structure to select a specific isomorphism, 
which in our case is already given to us by the inner product 77 , which was already 
seen to give an isomorphism 77 ; : V ^ V*; compare Remarks. This structure can 
clearly also be used to define an isomorphisms poin(M) ^ poin*(M). However, it 
is more convenient to define isomorphisms between each of these vector spaces and 
V 0 /\^ y, where /\^ V := V A y is the antisymmetric tensor product: 

poin(M) = V©Pa(V 0 V*) = V©(VaV) = V*©P/(V* 0 V) = poin*(M). (51) 

Indeed, note that under this isomorphism lor(V) gets mapped isomorphically onto 
the antisymmetric subspace /\^ V C V V. The corresponding representations 
on y 0 /\^ y, which are equivalent to Ad and Ad* under these isomorphisms, are 
respectively given by 



M) = [Lv — (L <S> LM) • a , L <S) LM), (52a) 

AJ*^ q(P, J) = {Lp , L <S> LJ — a A Lp), (52b) 

where the dot now abbreviates the inner product 77 in y, as explained in Remarks. 
So for a, b, c eY we write 



(a Ab) ' c = {a^b — b ^a) ' c = a p(b, c) — b p{a, c) =: a(b • c) — b(a • c). (53) 
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These are now two inequivalent representations of the same group on the same vector 
space. It is the second, co-adjoint representation that will be physically relevant. It 
differs from the adjoint representation on how it implements the normal subgroup 
of translations. Let us, for clarity, just display the two representations if restricted to 
the subgroup Trans (M): 



= {y — M • a , M), (54a) 

J) = {p , j - a A p). (54b) 

The obvious difference is that under the adjoint representation translations act non- 
trivially only on the first summand in V 0 (V A V) under the adjoint-, and non- 
trivially only on the second summand under the co-adjoint representation. As we 
will see below, the latter corresponds to the familiar origin-dependence of angular 
momentum and origin-independence of linear momentum. 

Finally, using the identification LiePoin(M) = V x (V A V), let us explicitly write 
down the Lie algebra (37) in terms of a basis. Let {Ca \ I < a < n} he sl basis of V, 

such that T](ea, e^) = ea • = rjab then {mab I l<< 2 <Z?<^}isa basis of V A V, 

where ruab := Ca A eb = {ea® eb — eb ® Ca). Then the Lie products in (37) become 



[ea,eb] = 0, (55a) 

{ea. mbc] = Vab ec ~ Vac eb, (55b) 

[f^ab^ ^cd\ — Vad ^bc H” Vbc ~ Vac f^bd ~ Vbd f^ac- (55c) 



3 The Momentum Map and the Natural Habitat of Globally 
Conserved Poincare- Charges 



We now regard Minkowski space as a Semi-Riemannian manifold (M, g) with 
Lorentzian metric g, which in affine/inertial coordinates (compare Definition 6) is 
of the form (in four spacetime-dimensions) 

g = Pab dx‘' ® dx^ , {?7afo) = diag(l, -1, -1, -1). (56) 

As discussed above, and in more detail in the Appendix “Group Actions on 
Manifolds”, for each X g poin(M) we have a vector field g Vec(M) that 
represents the “infinitesimal” left group-action of Poin(M) on M through an anti 
Lie-homomorphism po in (M) ^ Vec(M), X i-^ V^, satisfying (40). Since Poin(M) 
acts on M by isometries, the Lie derivative of g with respect to each is zero: 



L yx g — 0. 



(57) 
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In other words, each e Vec(M) is a Killing vector-field. 

Now, suppose we have an energy-momentum tensor 

T = Tab dx‘' (g) dx^ (58) 



which is divergence free with respect to the Levi-Civita covariant derivative deter- 
mined by In components with respect to arbitrary coordinate systems this reads 



VaT‘^'’ = daT^’’ + + ^'’acT‘‘" 



= 0 . 



(59) 



If the coordinates are affine/inertial, the F -coefficients are all zero. 

Another way to look at T is to regard it as a co- vector valued 3 -form. This is 
achieved by Hodge-dualising the second tensor factor in (58): 

T =Tabdx^ ®{*dx^), (60) 



where ★ is the Hodge duality map the definition of which, together with our conven- 
tions, are summarised in Appendix “Exterior Products and Hodge-Duality”. Now 
comes the important point in the whole construction: using the vector fields V^, we 
can, for each X g poin(M) turn (60) into a 3-form that linearly depends on X via 

Tx := = {V^YTab i*dx^) = {V^f 9ab ^ecdejdx‘‘ ^dx^ ^dx^ . (61) 

here iy denotes the map of inserting V into the first co- vector factor of the tensor it 
is applied to and Cabcd are the components of the measure 4-form induced by g. The 
zero-divergence condition (59) implies, in view of (57), that each Tx is closed: 



dTx = 0. (62) 

This means that to each X e poin(M) we can produce a number by integrating Tx 
over a 3 -dimensional hypersurface: 

m[F,S](X):= [ Tx[F]= [ ★o/^xoT[F]. (63) 

Js Js 

Here we wrote the integrand as a composition of three maps. The first (T) maps 
the field configuration E to a symmetric tensor, the second iyx contracts this tensor 
with the vector field and turns it into a one form, and the last (★) turns this 
one form into an ^ — 1 form (a three-form in four dimensions). The last map to 
be applied in order to get a number is to integrate this form over a hypersurface S. 
This number will depend on three arguments: The fields F on which T depends, the 
surface S over which we integrate, and the Lie algebra element X which we use to 
build y ^ to contract T with. The value 971 takes on all these arguments is called the 
corresponding momentum. 
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Suppose now that the fields F on which T depends carry a representation (not 
necessarily a linear one) of Poin(M). That is, we assume there is a left action D of 
Poin(M) on the space (not necessarily a vector space) of fields. We assume that the 
geometric object T is built entirely out of such fields, and that there is no dependence 
on any other geometric structure not included in our F . Then we have the covariance 
property^ 

T[DhF] = <t>h^T[F] (64) 



where O is as in (39) and denotes the push-forward of the diffeomorphism 
: M ^ M. If F denote standard scalar, vector, and tensor fields, then (64) 
merely says that the energy-momentum distribution of the pushed-forward fields is 
just the push-forward of the energy-momentum distribution of the original fields. 

Now we are interested in how the momentum changes if we act on the fields F by 
a Poincare transformation, leaving the arguments S, X untouched for the moment. 
We get: 



TO[D/,(F),5](X) = 

5 



★ O /yZ o r O Dk [F] 



L 



yX O o T [F] 



★ oi.-lyx oT [F] 



★ o o i 



^Ad^_lW 



oT[F] 



L 
I 

oT [F]^ 

j (*o / Ad i(X) o F [F]) 



= OT[F,cD^-,(5)](Ad^‘m) 

I Ad^(OT)[F,<I>;,-i(5)](X) 



(65) 



Here we broke up the derivation into seven steps, each one showing what happens 
as we commute the action of Poin(M) from right to left through the various maps 
connected by the o symbols. At the first step we use (64), at the second step we just 
use the obvious commutation property of push-forwards with the vector-insertion 
map, at the third step we use property (41), at the fourth step we use the covariance 
(intertwining property) of the Hodge map and the definition of the push-forward of 
a form as the pull-back by the inverse map, in the fifth step we use the elementary 



^This covariance property, which is crucial for the right representation-theoretic properties of the 
global charges, is hardly ever stated explicitly. A notable exception, more in words than in formulae, 
is Fock’s book [8] Sect. 31, where it is referred to as “physical principle”. 
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property of integrals, sometimes referred to as the “change-of-variables-formula”, 
in the sixth step we just use the definition (63), and in the seventh and last step we 
use the definition (47) of the co-adjoint representation. 

Now, if 5 is a Cauchy surface and the support conditions discussed initially are 
satisfies, we are ensured that the integral converges and the momentum actually 
exists. Moreover, if T is divergence free, as we assume here, the momentum does not 
depend on the particular Cauchy surface chosen, as follows from (62) and Gauss’ 
theorem. Hence we may delete S as an argument of 971 . Since Eq. (65) is valid for 
all X e poin(M), we may also delete the dependence on X, which is linear. We 
can then and regard (65) as an equation between elements in the dual of the Lie 
algebra depending merely on F and expressing the fact that they transform under the 
co-adjoint representation. 

Theorem 11 A divergence-free energy -momentum tensor describing a body in the 
sense of Definition 1 and depending on fields which carry a (not necessarily linear) 
representation D of Foin(M) defines a map from the space of field configurations to 
poin*(M), called momentum map, given by 

m(X):= j^Tx[F], (66) 

where S is any Cauchy surface. The map is Ad*-equivariant in the sense that 



dJl oDh=AdloWl, (67) 

for all h e Poin(M). □ 

Let us finally see how, and in what sense, the general formula (66) implies the naive 
expressions (2). Lor this we express V ^ in affine/inertial coordinates and choose for 
X basis elements of poin(M) that are adapted to the decomposition of poin(M) as 
semi-direct product V x lor(V). But here comes the point stressed above: there is 
no natural identification of V x lor(V) with poin(M). Any such identification is 
equivalent to the choice of a point G M. Only with respect to the choice of such a 
point does it make sense to speak of Lor (V) as a subgroup of Poin(M) and of lor(V) 
as a Lie subalgebra of poin(M). 

Let us now choose a system of affine/inertial coordinates so that the vector 
fields djdx^ are orthonormal [i.e. the Minkowski metric g takes the standard form 
(56)] . The coordinate values of the preferred point o is denoted by := x^ (o) . Then 
for X = (v, M) e V © (V ® V) is 

y(r.iW)(2) ^ 1 7]cb[(x‘' ~ z“)d/dx^ - (x^ - Z^)d/dx“]. (68) 

Note that x^ : M ^ M are coordinate functions on the manifold whereas = x^ (o) 
are fixed numbers (constant functions on M). The corresponding momentum is then 

[Z = {V, M)] = llabV^P’’ + lllacVbdM^^^ 



(69) 
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where, just as in (2), 



pa 



r\z\ 







(70a) 

(70b) 



Here u is the unit timelike normal to S and d/i = (the Hodge-dual of the one-form 

:= T]^{u)) is the induced measure (3-form) on S. Note that only the /’s depend 
on z because only they refer to the non-natural (i.e. (9-dependent) embedding of the 
Lorentz group into the Poincare group. In contrast, the translation group Trans (M) 
is normal and hence has a natural place in the Poincare group. Correspondingly, the 
linear momenta are natural and do not depend on any arbitrary choices. Note that 
it immediately follows from (70b) that 



J[z + (9] = J[z\ - a A P 



(71) 



which is just the co-adjoint representation of translations stated in (54b). 

Remark 1 2 The discussion up to this point answers all the questions posed initially in 
connection with (70) in the case of Special Relativity. Globally conserved quantities 
(charges) in connection with Poincare symmetry are valued in the vector space dual 
to the Lie algebra and transform according to the co-adjoint representation under 
Poincare transformations of the fields to which these quantities belong. The splitting 
of the space in which the charges take their values into a “translational part” and 
a “homogeneous part” is not natural as far as the latter is concerned. Therefore 
the charges of the homogeneous (Lorentz-) part has an additional dependence on 
a spacetime point whose choice fixes the embedding of the Lorentz group into the 
Poincare group. The very notion of, say, angular momentum depends on the choice 
of this point. 



4 Supplementary Conditions and Mass Centres 

The z dependence of J may be used to put further more or less physically motivated 
conditions on /[z] to restrict the choices of z. Conditions of that sort are known as 
supplementary conditions whose aim is to narrow down the choices of z to a one- 
parameter family z( A) which is timelike and somehow interpreted as the worldline of 
the body. This line has many names depending on what supplementary conditions one 
uses. It can be “centre-of-mass”, “centre-of-inertia”, “centre-of-gravity”, “centre- 
of-spin”, “centre-of-motion”, “centroid”, etc. Early discussions of some of these 
concepts in Special Relativity were given in [8, 9]. For comprehensive discussions 
see [10] and in particular [2]. 
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If u e Vi := {i; g V \ r](u,u) = l}isa unit timelike vector characterising 
an inertial frame of reference, we may, e.g., consider the supplementary condition 
(recall that a dot indicates a contraction using the Minkowski metric) 



J[z a] ' u = 0 4^ J[z] ' u — (P ' u) a (a ' u) P = 0. 



This is equivalent to a 



linear inhomogeneous equation for a 
J[z] • u 



U(a) = 



P • u 



where 



n =id- 



P • u 



(72) 



(73a) 



(73b) 



is the projector onto := {v e V \ v • u = 0} parallel to P (caution: not parallel 
to u). Hence the solution space is one-dimensional timelike line in V parallel to P: 

a(z,u;X) = -^ \-XP, A g M. (74) 

P • u 

Its dependence on z immediately follows from (74) and (71): 

a(z + b, u; A) = a[z, u; X-\- (u • b)/{u • P)) — b. (75) 

Equation (74) is a timelike line in V that represents the worldline of the centre -of - 
mass in M relative to the origin z. The wordline in M clearly does not depend on z 
(up to reparametrisation) and is simply given by 

7 (w; A) = z + a{z, u; A). (76) 



Definition 13 The curve A i-^ 7 ( 1 /; A) is called the centre -of -mass wordline relative 
to the inertial observer w. □ 

The body’s angular momentum with respect to this centre-of-mass is 

S(u) := Jljiu; A)] := J[z + a(z, u; A)]. (77) 

The right-hand side clearly does not depend on A since shifting A moves a(z,u; X) 
in the direction of P according to (74) and hence leaves J unchanged according to 
(71). It then also follows immediately from (75) that the right-hand side of (77) does 
not depend on z. Hence, as indicated, S only depends on u. 



Definition 14 S(u) is called the body’s spin with respect to the inertial 
observers. □ 
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Except for its dependence on u , this definition meets standard Newtonian intuition. 
Indeed, according to this intuition we would call 

L(z, u) := u\ X) A P (78) 

the orbital angular momentum relative to z and u (there is again no A-dependence 
due io P aP = 0). Equation (71) then just tells us that the total angular momentum 
is the sum of the spin and orbital parts: 

J[z\ = L[z + a{z, u; A)] + a{z, u; X) A P = S(u) + L(z, u). (79) 



As in Newtonian mechanics, the z-dependence of angular momentum resides exclu- 
sively in the orbital part. But S and L each also depend on u, though in such a way 
that their sum is independent of u. This gives rise to the following 

Remark 15 Unlike in Newtonian Mechanics, the splitting of the total angular 
momentum into a spin (z -independent) and an orbital (z-dependent) part depends 
on the inertial frame, here represented by u. □ 



Einally, using the expression (74) for a, we get the following expression for the 
spin part. 



S{u) := /[z] 



a{z, u; X) A P = u • 




(80a) 



which explicitly displays its w -dependence. Again note that the z-dependence of J 
(given by (72)) drops out due to the wedge product with P. The expression on the 
right-hand side of (80a) has a simple geometric interpretation, namely that of the 
(tensor-factor wise) projection of J[Z] parallel to P onto we may also write 



5(t/) = n(8)n(/[z]), 



(80b) 



where n is as in (73a). Note that application of n (g) n cancels the z-dependence 
of J and, in exchange, introduces a w -dependence. Erom both expressions (80) the 
defining Eq. (72) for the centre-of-mass. 



S(u) • u = —u • S(u) = 0 



( 81 ) 



follows trivially. In (75) we already stated the obvious dependence of the line A i-^ 
a(z, w; A) in V on z (which is just like in Newtonian physics). More interesting, and 
purely special-relativistic in nature, is its dependence on w. It is clear from (74) that 
any normal timelike vector u e in Eq. (74) yields a worldline A X) in 

M parallel to P. As u varies over the 3 -dimensional hyperbola Vi C V we obtain a 
bundle of straight lines (geodesics) in M parallel to P : 

^ = U U ^4- 

ueVi AeM 



( 82 ) 
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In that bundle a particular line 7 = 7 ^ is distinguished, namely that for which 
u oc P, i.e. 

u = u^:= P/\\P\\. (83) 

Here we use the notation \\P\\ := ^/\P~^~^. This is the only timelike direction the 
body determines by itself.^ 

Definition 16 The inertial frame for which u <x P is called the body’s rest frame 
and 

Mo := {u^ • P)/c (84) 

the body’s rest mass. The line A 1 -^ A), i.e. the centre-of-mass in the body’s 

rest frame, is called its centroid, or wordline of the centre -of- inertia [ 2 ]. □ 

Using (80) we can immediately write down the body’s spin relative to its rest 
frame, 

:= • (/[z] A ufj, (85) 

which is clearly independent of z. With respect to the body’s centroid, the bundle 
(82) of wordlines of mass-centres has a simple geometric description: 

Theorem 17 The intersection of the bundle B with the hyperplane 

a) := {x e M \ (x - z) ’ = a} (86) 



is a 2-disc in perpendicular to the axis of rotation and with radius is 



, 11^*11 11^*11 

Km = = • 

\\P\\ Moc 



(87) 



Definition 18 The radius (87) is called the M0ller radius, first defined in [11] and 
also discussed in, e.g., [1, 12]. It measures the degree to which different inertial 
observers disagree on the spatial location of the centre-of-mass perpendicular to 
the axis of rotation. Typical orders of magnitude for Mpller radii will be given 
below. □ 



Proof of Theorem 1 7 Note first that T^iu^, a) is the hyperplane with normal u^ oc P 
and timelike distance a from the point z. As we may choose any convenient z, we 
take it to lie on the centroid. The hyperplane through z is then 



E(w^, cr = 0) = {v G M I (v — z) • t/* = 0}. ( 88 ) 



^Here we assume that P is timelike, which essentially means that we assume the energy-momentum 
tensor to satisfy the condition of energy dominance. 
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Relative to that choice of z (on the centroid) all other mass centres have worldlines 



7(1/; A) := z + 



• li 

P • u 



+ XP, 



(89) 



with A parametrising the individual worldline and u G Vi the different mass-centres. 
Since R • = 0 the second and third term on the right-hand side are perpendicular, 

so that the wordline A) intersects E(w^, a = 0) at A = 0. Hence 



B n E(w=t=, cr = 0) 



Sjfi • u 




z + 


W G Vl 


P • u 





(90) 



The claim is that this is a 2-dimensional disc of radius (87) centred at z which lies in 
the plane perpendicular to the axis of rotation. To see this, we parametrise u by its 
boost-parameters relative to i.e., by its rapidity p G [0, 00) and spatial direction 
n G = 1, so that 

u = cosh(p) u^-\- sinh(p) n. (91) 



Then, assuming ||5'=,= || / 0, 



• u 

P • u 



II II ' fi 

In ^ 



tanh(p). 



(92) 



Note that n 1-^ maps into itself. Since it is a non-zero antisymmetric endo- 
morphism of the 3 -dimensional vector space ii necessarily has a one-dimensional 
kernel, which is the rotation axis (the common fixed-point set of the rotations gen- 
erated by the Lie-algebra element S^) and maps the plane perpendicular to that axis 
into itself. In fact, since we divided by US'* ||, the map in the plane perpendicular to 
the rotation axis is a rotation by 7t/2. Hence, as n runs over the unit 2-sphere in u-^ 
and tanh(p) over the intervall [0, 1), the image of the map u becomes the 

unit 2-disc in □ 

Remark 19 The condition = 0 makes effectively a tensor in the anti- 

symmetric tensor product of the 3 -dimensional space Since as well as its 
antisymmetric tensor product are 3 -dimensional, there exists an isomorphism relat- 
ing them. A preferred one is that of the 3 -dimensional Hodge duality map, ★, which 
is obtained from the full (4-dimensional) Hodge duality map, denoted by ★, by first 
applying ★ followed by left contraction with i.e., ★T \= - ^kT = ir(T A u^); 

compare (146) of Appendix “Exterior Products and Hodge-Duality”. In this way we 
can uniquely associate a spin vector with the spin-tensor as follows: 



:= —u^ • = — ★ (5* A u^), (93a) 

= —u^ • ★5* = — ★ (5* A u^). (93b) 



Equation (93a) can be seen as definition of and (93b) as its inverse relation. The 

latter can be obtained from taking the ★ of the first and using the fact that ★ o ★ is the 
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identity on antisymmetric tensors of odd degree in even dimensions and Lorentzian 
signature, which follows from combining formulae (140) and (145) of Appendix 
“Exterior Products and Hodge-Duality”. This gives 

★ 5* = -5'* A (94) 

Subsequent contraction with using = 0, yields (93b). In passing we also 

note that the component versions of (93) are 

S: = -\sabcdV‘‘''Sful, (95a) 

Sr = (95b) 

We note from (93b) that 

= i^(S^ A 5* A u^) = 0, (96) 

which means that lies in the intersection of with the kernel of S^. In other 
words, points along the axis of rotation. Finally we note that 

t?( 5*, i) = iiabSfsl = -\iiacVbdS‘^^S‘=‘^ = -ir/ ® t?(5*, 5*). (97) 

By the definition of the normalised inner product on antisymmetric tensors (i.e. 
dividing by 1/p! the p-fold tensor products of p on antisymmetric p-tensors) and 
setting 

115*11 :=7|(5*,5*} norm i (98) 



we have (recall H^*!! := S^)) 



11^*11 = II 5* II. (99) 

This justifies calling the Spin vector, which is associated to the (Lie-algebra 
valued) spin tensor . □ 

We end this section by justifying the terminology centre -of -mass. For this we recall 
that given an energy-momentum tensor T and a unit timelike direction u , then T{u,u) 
is the spatial energy-density in the rest frame of the inertial observer represented by 
u. More precisely, let us foliate the affine space M by affine hyperplanes 

a) := {x e M \ (x — z) ' u = a] (100) 

for some given u e and z g M. Each T>(u, a) is a spacelike hyperplane of 
Einstein- simultaneity in the inertial frame characterised by w. It is clearly also a 
Cauchy surface in Minkowski space. The 3 -form representing the spatial energy- 
density of T on E (m , a) is then 
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£(u, a) = T(u, u) (101) 

where is the measure 3-form on E(w, a) (the Hodge-dual to the 1-form := 
T]^(u) := T](u, ')). The first moment of this energy distribution with respect to z is 

m{z,u]a)\= / (x—z)£(u,a)j / £(u,a), ( 102 ) 

where we explicitly indicated all dependencies on z, w, and a and separated the latter 
by a semicolon to emphasise the special meaning of cr as “time-parameter” labelling 
the different leafs of the foliation orthogonal to u. The dependence on z is rather 
trivial: m{z -\- h,u\ cr) = m{z,u\ cr) — Z? so that the set of points 

7 (w; cr) = z + m(z, w; cr) (103) 



is independent of z. Moreover, from (102) it is obvious that cr) — z) ' u = cr 
so that 7 (w, cr) G E(w, cr). Note that the construction of the “first moment” refers to 
the affine structure of M. Given that T satisfies the weak energy-condition we have 
T(u,u) >0, so that 7 (w, cr) lies in the convex hull of supp(T) H E(w, cr). 

Now let us calculate the right-hand side of (102). The denominator is, in view of 
(70a), 

/ T(u,u) = u ' P (104) 

independent of cr because P is independent of the Cauchy surface the integral is taken 
over. The a\h component of the numerator can be transformed as follows (calling 
= dji and using component language) 



[ (X 
7S(w,cr) 



z)^£{u\ cr) = 



+ 



/ {x — z)^ T^^UbUc dji 
JT.{u,a) 

f 2{x — z)^^ T^^^UbUc dji 

/ (x — z)^ T^^UhUc dfi 
JT,(u,a) 



= J“‘’[z]Ub + (TP‘^ 



(105) 



where we used that (x — z)^Ua = cr for v G E(w, cr). In total we get 

/[z] • u P 

-f(u; a) = z + h - — cr, (106) 

P U P ’ u 

which, upon using the new parameter A := cr/(P-i/),just turns into (76) and 
(74). This justifies the term “centre-of-mass” in Definition 13, where “mass” is to 
be understood as proportional to energy. For a system of point particles this means 
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dynamical mass, not rest mass."^ We emphasise again that the essential use of the 
affine structure in this construction. In fact, the very notion of “first”, “second”, etc. 
“moments” of a distributions presuppose such a structure. 



5 Typical Mpller Radii 



The ambiguity expressed in (87) only exists for bodies with spin. The formula suggest 
that for elementary particles it may well be of the order of magnitude of other radii, but 
that for laboratory- size or astrophysical bodies it is likely to be completely negligible. 
Let us therefore compute a few examples. 

A spin-1/2 particle has Ill'll = hjl and thus 



h _ 1 _ 1 ^ 

2Mqc 47t Mqc 47t 



(107) 



where Ac is the particle’s Compton wavelength. If the particle is electrically charged 
it has a classical charge-radius /^classical determined by 



Stt^o ^classical 



= Moc\ 



Hence we have 



Kyi — /^classical ^ 137/?classical 



(108) 

(109) 



Lets look at the Proton: Its experimentally determined “proton radius” (CODATA 
2010) is 

= 0-87 ■ 10-15 m. (110) 

Its Compton wavelength is 

Aproton= 1.32- 10-15 m, (111) 



and its Mpller radius is 



D (Proton) 



= 1.05 -10-15111 

47T 




(Proton) 
charge ’ 



( 112 ) 



^This definition of centre-of-mass, using the first moment of the dynamical-mass distribution, 
corresponds to cases (c) (for arbitrary u) and (d) (for w = w^,) in [10]. See this reference for a brief 
historical account of other definitions, e.g., based on the first moment of the rest-mass distribution, 
and a discussion of their partly peculiar properties, like moving mass centres in the zero momentum 
frame. There is also the issue of the Poisson structure for the coordinates of mass-centres, linear 
momentum, and spin, which for the mass centres based on dynamical mass where first discussed in 
[9]. Again we refer to [2] for a comprehensive discussion. 
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In comparison, a homogeneous rigid body of mass M and Radius R, rigidly 
spinning at angular frequency uo, has spin angular-momentum equal to 

2 o 

S=-MR^uo (113) 

Hence the ratio of its Mpller radius to its geometric radius is 



Rm S 2 / Rlu\ 
~Y ~ McR ~ 5 \ V ) 



(114) 



which shows that this ratio is of the order of magnitude of the circumferential velocity 
in units of the velocity of light. Applying this to Earth and Moon (somewhat idealised) 
gives 



^^arth) ^ 
^^oon) ^ 



Note that Lunar Laser Ranging also locates the moon’s “position” within accuracy 
of centimeters. Hence the Mpller radius is not as ridiculously small as one might 
have anticipated it to be for astronomical bodies. In fact, lets take the fast spinning 
Pulsar PSR J1748-2446ad, whose frequency is 7 16 Hz corresponding to a period of 
1 .4 milliseconds, for which we get Rou jc ~ 0.24. Hence the ratio of its Mpller radius 
to its geometric radius is 



(x) 

\ ^ / Pulsar 



0 . 1 , 



(116) 



which is the typical ratio of relativistic effects for neutron stars. 



Appendices 

Exterior Products and Hodge-Duality 

Let y bearealn-dimensional vectorspace, y* itsdualspaceandr^y* = y*® • • -® 
y * its p-fold tensor product.^ jpy* carries a representation of Sp, the symmetric 

group (permutation group) of p objects, given by 

TTp : ^ End(r^y*), 7rp(c7)(ai ® • • • ® CTp) := ® • • • ® (117) 



^We follow standard tradition to define /om^, i.e. the antisymmetric tensor product on the dual 
vector space V* rather than on V. Clearly, all constructions that are to follow could likewise be 
made in terms if Vrather than V*. 
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and linear extension to sums of tensor products. On * we define the linear operator 

of antisymmetrisation by 



Alt;, ^ ^ sign(cr)7Tp, (118) 

^ (j&Sp 

where sign : 5;, ^ {1, — 1} = Z 2 is the sign-homomorphism. This linear operator 
is idempotent (i.e. a projection operator) and its image of under Alt;, is the 

subspace of totally antisymmetric tensor-products. We write 



Clearly 



We set 



TTp{TPV*) =: f\ y*. 






dimi A y* 1= •! for p<n, 
0 for p > n. 



Ay* :=0Ay*. 

p=0 



(119) 

( 120 ) 

( 121 ) 



Let a e j\P V* and (3 e /\‘^ V*, then we define their antisymmetric tensor product 



p+q 

^Altp+,(a ® /?) 6 A (122) 



One easily sees that 



aAf3 = i-iy^(3Aa. (123) 

Bilinear extension of A to all of /\ V* endows it with the structure of a real 2^- 
dimensional associative algebra, the so-called exterior algebra over .If a\, ap 

are in V*, we have 

ai A ■ ■ ■ A ap = ^ sign((T) aa(\) ® • • • (g) ao-(p), (124) 

(TESp 

as one easily shows from (122) and (123) using induction. 

If {0^, , 0^} is a basis of V*, a basis of /\^ V* is given by the following (p 

vectors 

{0^^ A’^-A0^p\l<ai<a2<-’<ap<n}. (125) 
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An expansion of a G /\^ V* in this basis is written as follows 

a=: jr ««,■■■«, A- (126) 

using standard summation convention and where the coefficients aa^...ap are totally 
antisymmetric in all indices. On the level of coefficients, (122) reads 

{OL A (3)ai...ap^q = ^^^^^^[av-apPap^v-ap+qh (127) 

where square brackets denote total antisymmetrisation in all indices enclosed: 

1 ^ 

^[av-ap] •= — Slgn(cr) ^^cr(l) ■■■^cr(p) ■ (128) 

(J^Sp 

Suppose there is an inner product (non-degenerate symmetric bilinear form) r] on 
V and the associated dual inner product r]~^ on V* (compare Remarks). The latter 
extends to an inner product on each V * by 

p 

{a\ ® ® Up, !3i ® ® !3p) := G /?a) (129) 

a=\ 

and bilinear extension: 

® ® 0“'’ , !3in...bp ® ® e'>o) = (130) 

In particular, it extends to each subspace /\^V* C We have 

p 

{ai A ■ ■ ■ A ap , Pi A ■ ■ ■ A Pp) := p\ sign(cr) P[ rjiaa, Pa(ap (131) 

aeSp a=l 

and hence 

A---Ae“P , jiPbi-b,0‘’^ A ■■■ A 9‘’p) = aa^...a,P‘'''"“'’ ■ d32) 

In the totally antisymmetric case it is more convenient to renormalise this product 
in a p -dependent fashion. One sets 

(■ ’ ')norml 1^* ^ ')\ /\P V* (133) 

SO that 

{jjaa,...a,O^^A---Ae“P , * A ■ ■ ■ (134) 

\F- A F- t' /norm 
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Given a choice o of an orientation of V* (e.g. induced by an orientation of V), 
there is a unique top-form 5 G /\^ V * (i.e. a volume form for V), associated with the 
triple (y*, 77 “^, 6 >), given by 



A... (135) 

where . . . , is any 77 “^ -orthonormal Basis of V* in the orientation class o. 
The Hodge-Duality map at level 0 < p < ^ is a linear isomorphism 



p n-p 

*p: f\v*^ /\ y*, (136a) 

defined implicitly by 

a A = e{a, /^)norm. (136b) 



This means that the image of P e /\^ V* under in /\^~^ V* is defined by the 
requirement that (136b) holds true for all ct G /\^ V*. Linearity is immediate and 
uniqueness of ★p follows from the fact that if A G /\^~^ V* and ct A A = 0 for all 
a G !\^ V, then A = 0. To show existence it is sufficient to define on basis 
vectors. Since (136b) is also linear in a it is sufficient to verify (136b) if a runs 
through all basis vectors. 

From now on we shall follow standard practice and drop the subscript p on ★, 
supposing that this will not cause confusion. 

Let {e\, . . .en} be a basis of V and {0^ , ,0^} its dual basis of V*; i.e. 0^{eb) = 
6^. Let further {Oi, ..., On} be the basis of V* given by the image of {e\, . . .Cn) 
under 77 ; (compare Remark 3), i.e. Oa = • Then, on the basis {Oai ^ - ^Oap \ 

I < a\ < 02 < ' ' ' < Op < n} of /\^ y* the map ★ has the simple form 

A ■ ■ ■ A A . ■ . A (137) 

This is proven by merely checking (136b) for a = 0^^ A • • • A O^p and P = 
Obi ^ ^ ^bp- Instead of (137) we can write 



A • • • A 0‘^r’) = Sbi-bpbp+i-bn A---A0’’ 



(n-p) 

1 a\---a.p 



{n-p)\ 



Clp-\-\ •••dfi 



0ap+l A • • • A 



(138) 



which makes explicit the dependence on e and rj. 

If a = jjaai-.-apO^^ A---A0“p, then *a = A ■ ■ ■ A 

where 

(139) 



/ \ 1 Ctl'-'Clp 

K^Opbi-’-bn-p = yi^av-dp^ b^... 



b\-"bn-p' 



This gives the familiar expression of Hodge-Duality in component language. Note 
that on component level the first (rather than last) p indices are contracted. 
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Applying ★ twice (i.e. actually *(n-p) °*p) leads to the following self-map of 
A--- a6>“0) 

= A... A 0”^ (140) 

= ^~plCp)! e”^ A- ■■ A 0”^ 

= (^-iy(n-p) A...A0"'’. 

Note that 

(s, e)norm = ' ' ' i"’'’’" £ai-an£bi-b„ = (£i 2 -n)^/ det{r?(eo , eb)}. (141) 

This formula holds for any volume form 5 in the definition (136b), independent of 
whether or not it is related to 77 . 

Since the right-hand side of (136b) is symmetric under the exchange a ^ P, so 
must be the left-hand side. Using (140) we get 

{a, /^)norm e = aAirP = pAira = ira A P 

= ^>norm * a A ★ ★ /? = (£, £)“q 5^ (★« , */3>norm £, (142) 

hence 

(★Q( , ★/3)norm = ^)norm (<^9 P)norm- (143) 

From this and (140) it follows for a e /\^ V* and P g /\^~^ V*, that 

(Q(, ★/3)norm = ^)norm('^^ ’ ^*P)novm — (~1)^^ /^) norm- (144) 

This shows that the adjoint map of ★ relative to (• , Onorm is (— 1 )^^'^“^^ ★. 

Formulae (140), (142)-(144) are valid for general 5 in the definition (136b). If 
we chose 5 in the way we did, namely as the unique volume form that assigns unit 
volume to an oriented orthonormal frame, as does (135), then we have 

(^,^)norm = (-l)'^- (145) 

where is the maximal dimension of subspaces in V restricted to which 77 is 
negative definite; i.e. 77 is of signature (^+, ri-). Equation (143) then shows that ★ is 
an isometry for even ri- and an anti-isometry for odd n- (as for Lorentzian r] in any 
dimension). 

Finally we note the following useful formula: If i; G V let V * ^ tp~^V* 

the map which inserts v into the first tensor factor. It restricts to a map iy : /\^ V * ^ 
yyp-i y* ae f\P V*, we have 
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a = ★(ct A i;^). (146) 

where := r]^{v) (compare Remarks). It suffices to prove this for basis elements 
V = 6a oiV and a = 0^^ A ••• A O^p of /\^ V*, which is almost immediate using 
(138). 

Group Actions on Manifolds 

Let G be a group and M a set. An action of G of M is a map 

O : G X M ^ M (147) 

such that, for all m G M and ^ G G the neutral element, 

0(^, m) = m, (148) 

and where, in addition, one of the following two conditions hold: 

0(^, 0(/z,m)) = ^(gh,m), (149a) 

0(/z,m)) = ^(hg,m). (149b) 

If (147), (148) and (149a) hold we speak of a left action. A right action satisfies 
(147), (148) and (149b). For a left action we also write 

0(^,m)=:^-m (150a) 

and for a right action 

<^{g, m) =\ m • g. (150b) 

Equation (149) then simply become (group multiplication is denoted by juxtaposition 
without a dot) 



g • (h • m) = (gh) • m, (151a) 

(m • h) ’ g = m ’ (hg). (151b) 

Holding either of the two arguments of O fixed we obtain the families of maps 

: M ^ M 



m \-^ ^{g, m) 



(152) 
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for each g e G,ov 



M 

g\-^^(g,m) (153) 

for each m e M. Note that (148) and (149) imply that O^-i = (0^)“^ Hence each 
is a bijection of M. The set of bijections of M will be denoted by Bij(M). It is 
naturally a group with group multiplication being given by composition of maps and 
the neutral element being given by the identity map. Conditions (148) and (149a) 
are then equivalent to the statement that the map G Bij(M), given by ^ is 

a group homomorphism. Likewise, (148) and (149b) is equivalent to the statement 
that this map is a group anti-homomorphism. 

The following terminology is standard: The set Stab(m) := {g e G \ ^(g, m) = 
m} C G is called the stabiliser of m. It is easily proven to be a normal subgroup of G 
satisfying Stab(^ • m) = g(Sid^)(m))g~^ for left and Stab(m • g) = g~^ (Stab(m))^ 
for right actions. The orbit of G through m e M is the set Orb(m) := {^(g, m) \ 
g G G} =: 0(G,m) (also written G • m for left and m • G for right action). It is easy 
to see that two orbits are either disjoint or identical. Hence the orbits partition M. A 
point m G M is called di fixed point of the action O iff Stab(m) = G. An action O is 
called effective iff ^(g, m) = m for all m G M implies g = e; i.e., “only the group 
identity moves nothing”. Alternatively, we may say that effectiveness is equivalent 
to the map G i-^ Bij(M), g \-> being injective; i.e., = id^ implies g = e. 

The action O is cdli^dfree iff ^(g, m) = m for some m e M implies g = e; i.e., “no 
g e fixes a point”. This is equivalent to the injectivity of all maps : G ^ M, 
g \-> ^(g,m), which can be expressed by saying that all orbits of G in M are faithful 
images of G. 

Here we are interested in smooth actions. For this we need to assume that G is a 
Lie group, that M a differentiable manifold, and that the map (147) is smooth. We 
denote by exp : T^G ^ G the exponential map. For each X e TeG there is a vector 
field on M, given by 



V^(m) = — 0(exp(rX),m) 
dt t=o 

= (154) 

Here denotes the differential of the map evaluated at ^ g G. is also 
called the fundamental vector field on M associated to the action O of G and to 
X G TeG. (We will later write Lie(G) for T^G, after we have discussed which Lie 
structure on TeG we choose.) 

In passing we note that from (154) it already follows that the flow map of is 
given by 

¥\p{m) = <I>(exp(fX),m). 



(155) 



Energy-Momentum Tensors and Motion in Special Relativity 



157 



This follows from exp(sX) exp(tX) = exp[(s -\-t)X) and (149) (any of them), which 
imply 



on the domain of M where all three maps appearing in (156) are defined. Uniqueness 
of flow maps for vector fields then suffice to show that (155) is indeed the flow of V ^ . 

Before we continue with the general case, we have a closer look at the special cases 
where M = G and O is either the left translation of G on G, O (g, h) = Lg{h) := gh, 
or the right translation, ^{g,h) = Rg(h) := hg. The corresponding fundamental 
vector fields (154) are denoted by and respectively: 



The seemingly paradoxical labeling of R for left and L for right translation finds its 
explanation in the fact that is right and is left invariant, i.e., Rg^^V^ = V/ 
and Lg^V^ = V^. Recall that the latter two equations are shorthands for 




(156) 




(157b) 



(157a) 



Rg*hV^(h) = V^ihg), 

Lg*hV^(h) = Vligh). 



(158a) 

(158b) 



The proofs of (158a) only uses (157a) and the chain rule: 




(159a) 



d 

dt t=o 




= V^ihg). 



Similarly, the proof of (158b) starts from (157b): 




d 

dt 




(159b) 



= vligh). 
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In particular, we have 



Viig) = Rg*eVl{e) = Rg^eX, (160a) 

yf (fl) = Lg«y#(e) = Lg^eX, (160b) 

showing that the vector spaces of right/left invariant vector fields on G are isomorphic 
to TeG. Moreover, the vector spaces of right/left invariant vector fields on G are 
Lie algebras, the Lie product being their ordinary commutator (as vector fields). 
This is true because the operation of commuting vector fields commutes with push- 
forward maps of diffeomorphisms: W] = This implies that the 

commutator of right/left invariant vector fields is again right/left invariant. Hence the 
isomorphisms can be used to turn TeG into a Lie algebra, identifying it either with 
the Lie algebra of right- or left-invariant vector fields. The standard convention is to 
choose the latter. Hence, for any X,Y e Lie(G), one defines 



[X,Y]:=[V^,vl]{e). (161) 

TeG endowed with that structure is called Lie(G). Clearly, this turns Vl : Lie(G) ^ 
Vec(G), X \-> into a Lie homomorphism: 

= (162) 

As a consequence, Vr : Lie(G) ^ Vec(G), X \-> now turns out to be an anti 
Lie isomorphism, i.e., to contain an extra minus sign: 

y/'^l ;=-[y|, yj], (163) 

This can be proven directly but will also follow from the more general considerations 
below. 

On G consider the map 



C : G X G ^ G 

(h, g) hgh~K (164) 

For fixed h this map, Ch '■ G ^ G, g Chig) = hgh~^, is an automorphism (i.e., 
self-isomorphism) of G. Automorphisms of G form a group (multiplication being 
composition of maps) which we denote by Aut(G). It is immediate that the map 
C ^ Aut(G), /z C/i, is a homomorphism of groups; i.e.. 



Ce = idc , 
ChoCk = Chk- 



(165a) 

(165b) 
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Taking the differential at ^ G G of we obtain a linear self-map of T^G, which 
we call Adh : 

Adh := : T^G T^G. (166a) 

Differentiating both sides of both Eq. (165) at ^ G G, using the chain rule together 
with Ck {e) = e for the second, we infer that 

Ade = idj;G, (166b) 

AdhoAdk = Ad^. (166c) 



This implies, firstly, that each linear map (166a) is invertible, i.e. an element of the 
general linear group GL(r^G) of the vector space T^G, and, secondly, that the map 

Ad : G ^ GL(T;G) 

h ^ Adh (167) 



is a group homomorphism. In other words. Ad is a linear representation of G on 
r^G, called the adjoint representation. 

In (158) we saw that and are invariant under the action of right and left 
translations respectively (hence their names). But what happens if we act on with 

left and on with right translations? The answer is obtained from straightforward 
computation. In the first case we get: 



Lg^h[VR{h)) 






dt 






d 

dt 

d 

dt 



exp(tX)h^ 

gh^ 



,.Adg(X) 

Va igh). 



(168a) 



where we used (166) in the last and the definition of in the first and last step. 
Similarly, in the second case we have 



Rg*h{Vl(h)) 






— I (hexp(tX)h) 
dt h=0 V / 

^ ' (h exp(rZ) g^ 

(hgCg-i{exp{tX))'j 



dt 

d 

dt 



t=o^ 



t=o 



Adg-l(X) 



(168b) 
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Taking the differential of Ad at ^ G G we obtain a linear map from TeG into 
End(r^G), the linear space of endomorphisms of TgG (linear self-maps of TeG). 



ad := Ad^e • TeG End(r^G) 
X \-> adx- 



(169) 



Now, we have 



adx(T) = [X, F] 



(170) 



where the right-hand side is defined in (161). The proof of (170) starts from the 
fact that the commutator of two vector fields can be expressed in terms of the Lie 
derivative of the second with respect the first vector field in the commutator, and the 
definition of the Lie derivative. We recall from (155) that the flow of the left invariant 

vector fields is given by right translation: El^ ^ (g) = g exp(rX). Then we have 



[X, Y] = [Vl, vlm 

= (LyxVl)(e) 

d 



dt 

d 

dt 

d 

dt 

d 

dt 



y? 

PI ^ 

Ei; 



(y/(Fiy\e))) 



d 



Flv 






t=0 ^ 

d 

Qxp(tX) exp(sY) Qxp(-tX) 



t=o ds 



5=0 



t=o 



Adexp(tX)(F) 



= adx(F). 



(171a) 



yX 

A completely analogous consideration, now using El^ ^ (g) = exp(rX) g, allows to 
compute the commutator of the right-invariant vector fields evaluated at ^ G G: 



[V#, yj](e) = (LyxVl)ie) 



d 

dt 

d 

dt 

d 

dt 

d 

dt 






Fir^.^i 



Fir^^ 



t=0 ^-•'>*ds\s=0 ' 

d 

exp(— rX) exp(^F) exp(r5') 



=ods 



s=0 



t=o 



Adexp(-rX)(F) 



= -adx(F) 
= -[A, Y], 



(171b) 
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Equation (163) now follows if we act on both sides of [V^ , V^](e) = —[X, Y] with 
Rg^e and use (158a). 

We now return to the general case where M is any manifold and the vector field 
is defined by an action O as in (154) and whose flow map is given by (155). 
Now, given that O is a right action, we obtain 



[V^, V^](m) 

= {LyxV^){m) 



d 

dt 

d 

dt 

d 

dt 

d 

dt 



%.(v' mr («»)) 



t=0 ^ 



piE 

t=o 

d 

t=Q ds 



Fi: 






5=0 



0(exp(rX) exp(sY) Qxp(-tX), m) 



t=o 



?(Adexp(?X)(f")) 






(172a) 



where we used (155) and (149b) at the fourth and (170) at the last equality. Similarly, 
if O is a left action, we have 



= (LyxV^)(m) 

d 



dt 

d 

dt 

d 

dt 

d 

dt 



d 



t=Q 



FI 









d 

t=o ds 



5=0 



0(exp(— rX) exp(5'7) exp(rX), m) 



t=o 



(Adexp(— tX){Y)) 






(172b) 



where we used (155) and (149a) at the fourth and again (170) at the last equality. 

Finally we derive the analog of (168) in the general case. This corresponds to 
computing the push-forward of under If O is a left action we will obtain the 
analog of (168a), and the analog of (168b) if O is a right action. For easier readability 
we shall also make use of the notation (150). For a left action we then get 
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dt 

d 

dt 



0(^f Qxp(tX), m) 
^{Cg{cx^{tX)),g ■ m) 



^(g-m)*e - 



dt 



t=0 



Cg{exp(tX)) 



= ■ m) 



Similarly, if <I> is a right action, 



^g*ni{v^(m)) = ^g*m— ^_^<l>(exp(rZ), m) 



dt 

d 

dt 



t=0 



0(exp(rX) g, m) 
0(C^-i(exp(rX)),m • g) 



— ^{m-g)^e . 



dt 



t=o 



Cg-i (exp(^X)) 



— ^(m-p)*6 (^)) 

t • 

(0(^, m)). 



= y 9 \m • g) 



= yAd^_im 



(173a) 



(173b) 
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Solutions of Mathisson-Papapetrou 
Equations for Highly Relativistic 
Spinning Particles 



Roman Plyatsko, Mykola Fenyk and Oleksandr Stefanyshyn 



Abstract Different types of essentially nongeodesic motions of highly relativistic 
spinning particles in Schwarzschild’s and Kerr’s background which follows from 
the Mathisson-Papapetrou (MP) equations are considered. It is shown that depen- 
dently on the correlation of signs of the spin and the particle’s orbital velocity the 
spin-gravity coupling acts as a significant repulsive or attractive force. Numerical 
estimates for electrons, protons, and neutrinos in the gravitational field of black holes 
are presented. The correspondence between the general relativistic Dirac equation 
and MP equations is discussed. It is stressed that for the highly relativistic motions 
the adequate supplementary condition for the MP equations is the Mathisson-Pirani 
condition. In the following it is important to study the possible role of the highly rel- 
ativistic spin-gravity coupling in astrophysics, cosmology, and high energy physics. 



1 Introduction 

After obtaining the general relativistic equations of motion for a test body with inner 
rotation/nonquantum spinning particle in [1, 2], investigations of their solutions for 
some backgrounds were started in [3-7]. Schwarzschild’s metric was taken into 
account in [3, 5, 6], Lense-Thirring’s and Melvin’s metric were under consideration 
in [4, 7] correspondingly. The concrete physical effects of spin-gravity coupling on 
rotating body trajectories have been studied. For example, it was shown that the cor- 
rection in perihelion motion of the planet Mercury because of its spin is negligible 
[3]. Similar results were obtained for other cases of spinning particle motions in 
the gravitational field which were considered at that time. Therefore, it is naturally 
that the corresponding statement was formulated in the known book [8] from 1973. 
However, the same year is dated paper [9] where another supposition one can read: 
“The simple act of endowing a black hole with angular momentum has led to an 
unexpected richness of possible physical phenomena. It seems appropriate to ask 
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whether endowing the test body with intrinsic spin might not also lead to surprises.” 
This paper, together with [10], where the spin- spin and spin-orbit gravitational inter- 
actions were considered, gave the impulse for realizing our program of more detailed 
investigations of physical effects following from the MP equations. We began from 
searching possible new analytical solutions of the exact Mathisson-Papapetrou equa- 
tions in Schwarzschild’s background without the supposition that they must be close 
to the corresponding solutions of the geodesic equations. That is, in our calculations 
we did not use the approach of small corrections because of the spin to the known 
geodesic solutions. 

It is known that the exact MP equations have a simple analytical solution for the 
radial motion with any oriented spin in Schwarzschild’s background: this solution 
describe the particle’s world line which coincides with the corresponding geodesic 
world line. Other solutions of the MP equations in Schwarzschild’s background, 
which describe the circular orbits of a particle with spin orthogonal to the equatorial 
plane of its motion, is known as well. These solutions differ from the corresponding 
geodesic solutions because of the terms which describe the gravitational spin-orbit 
interaction. At the same time, it is not difficult to check that the MP equations do not 
admit solutions in Schwarzschild’s equatorial plane with the spin non-orthogonal to 
this plane. However, paper [10] inspired the question: do the MP equations admit any 
solution which describe circular orbits of a spinning particle beyond the equatorial 
plane of Schwarzschild’s background? Indeed, it was shown in [10] that according to 
the MP equations the spin-spin interaction cannot maintain a spinning test particle at 
rest on the axis of rotation of Kerr’s mass. Nevertheless, this fact does not exclude a 
priori an effect of particle hovering above Kerr’s or Schwarzschild’s mass due to the 
spin-orbit interaction, i.e., for some dynamical, nor statical, cases. Then it was shown 
that the MP equations admit the corresponding solutions both in the Schwarzschild 
[11, 12] and Kerr backgrounds [13] which describe just the circular non-equatorial 
orbits. It is important that for realizing these orbits a spinning particle must posses 
the highly relativistic velocity relative to a source of the gravitational field. Late those 
and other results of studying the highly relativistic spinning particle motions were 
summarized in book [14]. Further development of the corresponding direction of 
investigations for the last 15 years is presented in [15-23]. 

The important point in the description of spinning particle motions by the MP 
equations is the choice of an appropriate supplementary condition. From the first 
steps in our investigations we chose the Mathisson-Pirani supplementary condition 
[1, 24] as a basic one. In this choosing we took into account the results of papers 
[25, 26]: if we need to describe, in the proper sense, just the inner rotation of the 
body, it is necessary to use the Mathisson-Pirani condition. Nevertheless, in many 
cases, when the body/particle velocity is not very close to the velocity of light, this 
condition can be substitute by the Tulczyjew-Dixon condition with high accuracy 
(more on this subject we write below). 

Here we present our results of the theoretical investigations of the highly rela- 
tivistic spin-gravity coupling by analysis of the corresponding solutions of the MP 
equations in Schwarzschild’s and Kerr’s background. Naturally, from a practical 
point of view the situation with a macroscopic test particle moving relative to a 
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massive body with the velocity close to the velocity of light is not realistic. However, 
the highly relativistic velocities are usual in astrophysics for elementary particles. In 
this connection we stress the important fact: in many papers [27-35] it was shown 
that in a certain sense the MP equations follow from the quantum general relativistic 
Dirac equation [36-38] as a classical approximation. 



2 Different Representations of Mathisson-Papapetrou 
Equations 

Because the traditional form of the MP equations are presented above in other papers, 
in this section we write only their non-traditional forms which are convenient for 
further analysis of physical meaning of the corresponding solutions. 



2.1 MP Equations Through Spin 3 -Vector 



For description of the particle spin by the MP equations, in many papers both the 
tensor of spin and the 4-vector of spin s\ are used, where by definition 

•?A = , 5^^ = e^l^^'^UuSa, (1) 

where and g are the Levi-Civita symbol and the determinant of the metric tensor 

correspondingly (here, and in the following, Greek indices run 1, 2, 3, 4 and Latin 
indices 1, 2, 3; the signature of the metric + is chosen; units c = G = 1 

are used). That is, instead of the six components one can deal with the four 
components sx- In the term of s\ the Mathisson-Pirani condition is 

s\u^ = 0 , ( 2 ) 



and the spin part of the MP equations takes the form 



Ds^ 

ds 



Du^ 
^ ds 



( 3 ) 



i.e., according to the MP equations at the Mathisson-Pirani the 4- vector of spin is 
Fermi transported [8]. 

For further reduction of the number of the spin components which are presented 
explicitly in the MP equations, it is possible to do another step: we can operate 
only with the three components of the spin tensor which contain only the spatial 
indices, i.e., because the three other components of this tensor by the Mathisson- 
Pirani condition are expressed as 
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= (4) 

U4 

The useful form of the MP equation can be obtained if instead of the spin tensor 
components the 3 -component value St is introduced by definition 

Si = (5) 

2U4 

where em is the spatial Levi-Civita symbol; the expression of through Si is 












( 6 ) 



Then the transport part of the MP equations can be written as [14, 15] 



^ as 



[l5l + r!^pU^uP)(U4S[ngp^n + g4pS[pU„]) 



+ = 0, (7) 

where for the free indices m, n and p it is necessary to put the circle combinations 
1, 2, 3; 2, 3, 1; 3, 1, 2, and the fourth equation of the MP transport part can be 
transformed to the form 



miu"^ -h + 



d ■ 
ds - 



\g\-y^(u^^ + r’^^u'^uP)gpkUie‘'^‘Si 



+ ^\g\-^^^uHu4Rtik + 2uiRl,^^)s’'^‘Si = 0 ( 8 ) 

(a dot denotes differentiation with respect to the proper time 5', and square brackets 
denote antisymmetrization of indices). 

The all three independent equations of the spin part of the MP equations can be 
written as [14, 21] 

U4Si -h 2(U[4Ui] - u'^Upr^^^Ui])SkU^ 

+ = (9) 



In practical calculations for the concrete metric, both analytical and numerical, 
it is often more convenience to use the MP equation representation just through the 
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3-vector St . In particular, at a glance, from (9) we see that the spin part of the MP 
equations becomes much simpler if the condition SiU^ = 0 is satisfied, i.e., when the 
spin is orthogonal to the particle’s trajectory. 

According to (1) and (5), there is a simple relationship between Si and 

5 ,- = + ^^ 4 . ( 10 ) 

U4 



2.2 MP Equations in Comoving Tetrad Representation 

In many cases, it is important to describe a physical event in different frames of 
reference. Concerning the motion of a spinning test particle in the gravitational field, 
it is useful to consider the main properties of this motion from the point of view of an 
observer moving with the spinning particle, with respect to which the representative 
point of the particle is at rest. 

As usual, the comoving frame of reference are determined by a set of orthogonal 
tetrads where and 

= V{iy)ip)^ ^i^^V{7r)ip) (H) 

(here, and in the following, in contrast to the indices of the global coordinates, the 
local indices are placed in the parenthesis; r/(^)(p) is the Minkowski tensor). 

The local components of the spin 4-vector for a comoving observer satisfy the 
condition 5* ( 4 ) = 0 [26]. Concerning the spatial components of this vector, without 
loss in generality, for convince we direct the first vector along the direction of spin, 
that is we write 

‘^(l) / 0, ^^(2) = 0, 5'(3) = 0. S(4) = 0. (12) 

The MP equations have the constant of motion 5 q = where l^ol is the 

absolute value of the particle spin [26], and according to (12) \ = l^ol- 

By relationships (11) and (12) for the local components of the spin 3-vector we 
have 

^(1) = -^(1) / 0, 5(2) = 0, 5(3) = 0. (13) 

Then according to (10) the global components of this 3 -vector are connected with 
its local components by the relationship 

Af & = (AfA,!" - A™A«) ( 14 ) 



It follows from Eq. (9) that 



7 (*:)( 1 )( 4 ) = 0, 



(15) 
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where 7 (y^)(i)( 4 ) are the Ricci coefficients of rotation. Because these coefficients 
are antisymmetric in the first two indices, Eq. (15) contains the two relationships: 
7(2) ( 1 ) (4) = 0 and 7 ( 3 ) (i) ( 4 ) = 0. The similar third relationship, 7 ( 2 ) ( 3 ) ( 4 ) = 0, can be 
added if one need exclude the rotation of the second and the third local spatial vectors 
relative to the direction of the first vector, which in our consideration is connected 
with the spin orientation. Therefore, we are free to put 

l(i)(k)(A) = 0. (16) 

Equation (16) is the known condition of the Eermi transport for the local orthonormal 
vectors. 

By direct calculations, which are quite simple but rather lengthy, it follows from 
Eq. (7) that 

»I7(1)(4)(4) = 5'(1)7?(1)(4)(2)(3), (17) 

W7(2)(4)(4) = 5(1)(1?(2)(4)(2)(3) ~ 7(3)(4)(4) ~ 7(2) (4) (4) 7(2) (3) (4)), (18) 

'«7(3)(4)(4) = 5(1) (1? (3) (4) (2) (3) + 7(2)(4)(4) - 7(3)(4)(4)7(2)(3)(4))- (19) 

Here, as for the spin part of the MP equations, the first local spatial vector is deter- 
mined by the spin orientation. 

It is known that the value 7 ^( 4 ) ( 4 ) is the dynamical characteristic of the reference 
frame, namely, its acceleration. 

In the linear spin approximation instead of Eqs. (17)-(19) we have the more simple 
equations 

W7(0(4)(4) = -5(1)/?(1)(4)(2)(3)- (20) 

That is, according to (20) the force of the spin-gravity interaction, as estimated by 
the comoving observer in the linear spin approximation, is fully determined by the 
local components of the Riemann tensor. We shell analyze expression (20) in some 
concrete cases of spinning particle motions in the gravitational field. 



3 Highly Relativistic Solutions of MP Equations 
in Schwarzschild’s Background 

We begin this section from consideration of a physical situation which is not con- 
nected directly with the MP equations and, in a certain sense, is similar to the known 
one in electrodynamics, when the electromagnetic field of a moving electric charge 
is under consideration. Now we are interested in the gravitational field of a moving 
mass. More exactly, let us analyze the expressions for Riemann’s tensor components 
as evaluated by an observer which is moving with any velocity relative to Schwarz- 
schild’s mass. 
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3.1 Gravitational Field of a Moving Schwarzschild^s Mass 



As usual, to describe a moving observer we can use the corresponding set of orthog- 
onal tetrads . Following many papers where the deeper analogies between gravi- 
tation and electromagnetism are under investigations we consider the gravitoelectric 
and gravitomagnetic components of the gravitational field. For example, 
in [39] these components are determined as 



E 



(0 

ik) 



= R 



( 0 ( 4 ) 

(^)( 4 )’ 



( 21 ) 



B 



(0 _ 
ik) - 



1 p(0(4) _(m)(n) 

{m){nf {ky 



( 22 ) 



Here our consideration is restricted by the gravitomagnetic components only for the 
observer which is moving relative to Schwarzschild’s mass. The space local vectors 
can be oriented as: (1) The first vector is orthogonal to the plane that is determined 
by the instantaneous direction of the observer motion relative to the Schwarzschild 
mass and the radial direction; (2) The second vector is directed along the direction 
of the motion of the observer. Then using the standard Schwarzschild coordinates 
we obtain the nonzero components of the gravitomagnetric field 



B 



( 1 ) 

( 2 ) 



«(2) 

= B^^^ = 



r'^yj u\u^ — 1 




(23) 



B 



( 1 ) 

( 3 ) 



= B 



( 3 ) _ 
( 1 ) “ 



r^y/u^u^ — 1 




(24) 



where u\\ = dr jds and u_\_ = rdLpjds are the radial and tangential components of 
the observer’s 4- velocity, and M is Schwarzschild’s mass. 

Let us analyze expressions (23) and (24) at different velocities of the observer 
relative to Schwarzschild’s mass. Note that the gravitomagnetic components are 
nonzero only at 7 ^ 0. That is, here the situation is similar to the known one in the 
electrodynamic for the components of the magnetic field of a moving electric charge. 
Then it is easy to see that the components (23) and (24) significantly depend on the 
velocity of an observer relative to the Schwarzschild mass. Indeed, at \uj_\ 1, 

I w II I ^ 1, the common term 3M/ in expressions (23) and (24) is multiplied on the 
corresponding small values. Whereas in the highly relativistic case, for ^ 1, 
this term is multiplied by the large (as compare to 1 ) values and then we have 



B 



( 1 ) 

( 2 ) 



B 



( 2 ) 

( 1 ) 



3M 



B, 



( 1 ) 

( 3 ) 



B 



( 3 ) 

( 1 ) 



3M 



(25) 



where 7 is the relativistic Lorentz factor as calculated by the particle velocity relative 
to the Schwarzschild mass. 
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3.2 Highly Relativistic Spin-Orbit Acceleration as Measured 
by a Comoving Observer 



What physical effects are caused by the gravitomagnetic components (23)-(25)? To 
answer this question, let us consider Eq. (20) which can be written as 



Cl{i) 



5(1) 






( 0 ( 4 )( 2 )( 3 ), 



(26) 



where a^i) are the local components of the particle 3 -acceleration relative to geodesic 
free fall as measured by the comoving observer; S(\) is the single nonzero component 
of the particle spin, i.e., here the first local space vector is oriented along the spin. 

In the concrete case of particle motion in Schwarzschild’s background, when 
the particle’s spin is orthogonal to the plane determined by the direction of particle 
motion and the radial direction, by ( 22 ) and (26) we have 



^ - 5(1) p(i) 

^(0 - ^ ^( i ) ’ 



(27) 



where the nonzero values of come from (23) and (24). According to (25), the 
acceleration components (27) depend, in the case of highly relativistic nonradial 
motions, on 7 such that a( 2 ) ^ 7, a( 3 ) ^ 7^. The component remain equal to 
zero at any velocity. The absolute value of the 3 -acceleration is proportional to 7 ^. 

So, according to the MP equations, from the point of view of the observer comov- 
ing with a spinning particle in Schwarzschild’s background, the spin-gravity inter- 
action is much greater at the highly relativistic particle’s velocity than at the low 
velocity. This interaction has the clear feature of the spin-orbit interaction. However, 
it is interesting to estimate the effects of this interaction for another observer, e.g., 
which is at rest relative to the Schwarzschild mass. 



3.3 Exact MP Equations in Schwarzschild^s Background 
After Using Constants of Motion 

Due to the symmetry of Schwarzschild’s and Kerr’s metric here the MP equations 
have the constants of motion: the particle’s energy E and the projection of its angular 
momentum [5, 40-42]. It is known that in the case of the geodesic equations the 
analogous constants of motion were effectively used for analyzing possible orbits of a 
spinless particle in these metrics [43] . Namely, by the constants of energy and angular 
momentum the standard form of the geodesic equations, which are the differential 
equations of the second order by the coordinates, can be reduced to the differential 
equations of the first order. Naturally, it is interesting to apply the similar procedure 
to the exact MP equations. However, in contrast to the geodesic equations, the exact 
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MP equations at the Mathisson-Pirani condition contain the third derivatives of the 
coordinates. 

The corresponding explicit form of the MP equations in Kerr’s background is 
presented in [21]. Here we write these equations for Schwarzschild’s background. 
As in [21], we use the set of the 1 1 dimensionless quantities ji where by definition 

j 5 = M*, j 6 = Mu^, yi = Mu^ , 

Sx S2 

^9 = — 3^10 = Jll = 

mM mM^ 

In addition, we introduce the dimensionless quantities connected with the particle’s 
proper time 5 ' and the constants of motion E, J^ : 



M 






m 



S3 



(28) 



S ^ E ^ Jy 

X = — , E = —, J = . 

M m mM 

Some quantities from (28) satisfy the four simple equations 

yi = ys, h = ye, h = yi, n = j's, 



(29) 



(30) 



here and in the following a dot denotes the usual derivative with respect to x. Other 
seven equations for the 1 1 functions y/ can be written as 



where 



h = ye = M, >^7 = ^ 3 , = ^ 4 , 

yq = A5, yio = A6, yii = A7, 



(’ 



Ai = A + 1 yxyl + yiy^ sh?- y2 - ^ U + 

* yi J yfq 



A 2 = B J 5 y 6 + J 7 sin j 2 cos j 2 , 

y\ 



q *(-)’ 7 )’iosin ^)’2 



A3 = (^-q hsyio + )’i yew) X 

+y6yn)A + + y\yiy9 sin^ w) 



B 



(31) 

(32) 
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E 1 2 

-ys sm y2 H sin y2 + iyiyio sm y2 

S{)q y\q 



1-2 2 

yeyn) sm y 2 ysji ~ '^yeyi cot y 2 , 

-I 



M = j8 i^-q ^ysyio + yh 6 y 9 ^ 



^\_-q ^yioA + ylygB - yf 



+ - 



so sm y 2 yi 



, yio 

H y9 cot y2 



2 yy . 

sm y2 

ym 



yf 






^ (^“'wyio - j’D’ewj 

[[yyys? (-)’5)’io)’r^ + qy9{y6yw + yiyn)) 

{qy9 + ysCysw + >’6)’io))] 



y?,ynq 
y] Sin^ )>2 



[ 2 E 1 2 1 

: Ji smj2 + — yCyyJiosm yy-jejii) 

L qe^y% qy^ J 



, ^J 8 r 2 , >' 5 >’ll 2 , , xl 

H — 2 “ ^'syio + qy\ y9(j6yio + yiyn) 

yf L sm Vo -I 



sm"' y2 



( 






+ 



qy^yiQ) 



50 sm y 2 y\ 



- y9 cot y2 



) 



— {y 9 + q VsCysjQ + yejio + yvjii)] 

Sq sm y 2 

v 6 y 9 2 

X ^?(->’ 7 )’iosin y2 + y6yn) 

L y\ 

-yiiC^^Vsjio - yfyew)”*], 

5 [j9 + ^“*j5(j5y9 + yeym + jvyn)] 
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q ^{yiQ + y\y6{y5y9 + y6y\{) + yiyii))A 



H y\yi siny2 + q ^yiiyiyio sin^ }^2 - yeyn) 



qeo 



-y^ 



( — ^ h qymoyi ^ - qysy9 cot ^2) 

V5osmy2 ^ 



. 3^63^10 + 3^73^11 , , , , 

^5 = qys qys(y5y9 + 3^63^10 + 3^73^11) 

3^1 



\(a yl \ ^-1 

LV >'8 ^yj 



j| \ _i 2 2-2 1 ^8 

2 U - yiye - yiyj sm y2 + -2 

ys qyf J yf 



^6 = -liyeysygiyi - 3) + 



gysysyio 

yi 



+ <iy 7 y 8 yii cot>'2 - gysiysyg + y^yw + yiyn) 

yh 

3^8 



2 

x[y?A2- ^^A4 + 2y5ye(y\ cos j2sin J2], 



. qysysyu , . 2 

Ay = qyiyiy^iyi - 3) sm y 2 

yi 



+ ^yeysyii cot j'2 - qyiy^yw cos >'2 sin y2 

r 2 2 3^?3^7 2 

■^3^8(3^5J9 + 3^63^10 + 3^73^11) X yi A3 sm y2 ^48111 y2 

L 3^8 

+ 2ys3'7(3'i -^“^)sin^y2 + 2y^y63'7COsy2siny2]. 



In all expressions (33) we note 



and 



q = 1 



3^1 



50 = 



l^ol 

mM 



(33) 



(34) 



We stress that while dealing with the MP equations in Schwarzschild’s or Kerr’s 
background the condition for a spinning test particle 
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(35) 



mr 



must be taken into account [10]. 

3.4 Highly Relativistic Circular Equatorial Orbits 
of a Spinning Particle 

Equations (31) and (32) can be used for description of all possible types of the spin- 
ning particle motions in Schwarzschild’s background according to the MP equations 
under the Mathisson-Pirani condition. We begin from considering the important par- 
tial solutions which describe the circular orbits of a spinning particle in the equatorial 
plane 6> = tt/ 2 of Schwarzschild’s metric when the spin is orthogonal to this plane. 
Then all the MP equations are reduced to the single algebraic equation, where among 
the values ji only yi, yv, and yg are present [22]: 



(without any loss in generality, this equation is written for the orientation of the 
particle’s spin when S 2 = Sq >0). Per relationship = 1 the value yg can be 
expressed through yv : 



In the limiting transition to the spinless particle (^o = 0) we have from (36) the result 
known from the geodesic equations. 

Equations (36) with (37) determine the region of existence of the circular orbits and 
the dependence of the particle’s angular velocity, which in notation (28) correspond to 
yv, on the radial coordinate. Naturally, it is interesting to compare the situations with 
the spinless and spinning particle. The known result is that by the geodesic equations 
these orbits are allowed only for r > 3M and the highly relativistic circular orbits 
exist only for r = 3M(l S), where 0 < S 1. In this context we point out 
that Eq. (36) admits the highly relativistic circular orbit of a spinning particle with 
r = 3M, i.e., yi = 3. Indeed, it is easy to calculate the single real root yv of Eq. (36) 
in the simple analytical form 



jyCji - - yiiy^ - 3)jf 



+ yi{2y\ - 3)eo)’8 + )’i = 0 



(36) 




(37) 



3 - 3/4 



yi = ^(1 + O(eo)) 



(38) 
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(here we take into account that by (35) it is necessary ^ 1). Then the absolute 
value of the particle orbital 4- velocity, which corresponds to (38), in the main spin 
approximation is equal to The same value has the relativistic Lorentz 

7 -factor of the spinning particle as calculated by an observer which is at rest relative 
to Schwarzschild’s mass. Hence 7 ^ ^ 1, i.e., for the motion on the circular orbit 
with r = 3M the velocity of a spinning particle must be ultrarelativistic, the more 
ultrarelativistic the smaller is the ratio of spin to mass of the particle. 

It is known that by the geodesic equations a spinless particle with nonzero mass 
of any velocity close to the velocity of light, starting in the tangential direction from 
the position r = 3M, will fall on the horizon surface within a finite proper time. 
On the other hand, a spinning particle can remain indefinitely on the circular orbit 
with r = 3M due to the highly relativistic spin-gravity coupling if condition (38) 
is satisfied. It means that in this concrete situation the spin-gravity coupling acts as 
a strong repulsive force which compensates the usual (geodesic) attraction. By the 
way, from the point of view of an observer comoving with the spinning particle the 
value of this force is the same order as the formally calculated value of the Newtonian 
gravitational force for the masses M and m at distance r = 3M [16]. 

Certainly, the picture described for the spinning particle on the circular orbit with 
r = 3M holds in the ideal case only, when perturbations are neglected. Some cases 
with perturbations (noncircular motions) we shall consider below. 

We stress the important feature of the partial circular solution of the MP equations 
with (38): this solution is common for the exact MP equations and their linear spin 
approximation and, as a result, both for the Mathis son-Pirani and Tulczyjew-Dixon 
condition [14, 17]. 

Let us consider other circular orbits of a spinning particle in Schwarzschild’s 
background which are described by Eq. (38). The detailed analysis is presented in 
[ 22 ] and here we show some graphs which illustrate the domain of existence of 
the highly relativistic circular orbits and the dependence of the 7 -factor, which is 
necessary for realizing these orbits, on the radial coordinate. Figure 1 describes the 



Fig. 1 Lorentz factor versus 
r for the highly relativistic 
circular orbits with 
dif/ds < 0 of the spinning 
particle beyond the small 
neighborhood of r = 3M 
according to the exact MP 
equations 
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Fig. 2 Lorentz factor versus 
(5 for the highly relativistic 
circular orbits with 
dif Ids < 0 of the spinning 
particle in the small 
neighborhood of r = 3M 
according to the exact MP 
equations {solid line) and 
their linear spin 
approximation {dashed line) 




orbits with the negative values of the particle orbital velocity (yv < 0) at Sq > 0, i.e., 
the signs of these values are the same as in the case of the highly relativistic circular 
orbit with yi = 3 and yq from (38). (In these figures below we put eo = 10“^). The 
orbits corresponding to Fig. 1 exist in the space region 2M < r < 3M and, similarly 
to the circular orbit with = 3, these orbits are caused by the significant repulsive 
action of the highly relativistic spin-gravity coupling. However, in contrast to the 
orbit with y^ = 3, which is fully determined by the linear spin terms, for the orbits in 
Fig. 1 the nonlinear terms are important. In this context Fig. 2 shows the difference 
in the description of the highly relativistic circular orbits of a spinning particle in the 
small neighborhood of yi = 3, that is when yi = 3 -h (5, (5 < 0, |(5| ^ 1, by the exact 
MP equations and their linear spin approximation. 

According to the solutions of Eq. (36) for different yi , the highly relativistic orbits 
of a spinning particle in Schwarzschild’s background exist for r > 3M as well. 
However, in contrast to the orbits from 2M < r < 3M, they are caused by the 
significant attractive action of the spin-gravity coupling, and for realizing these orbits 
the sign of the particle orbital velocity must be positive {dcpjds > 0) for Sq > 0. 
Figure 3 describes the small space domain, where yi = 3-h(5, 0<(5^ 1, and Fig. 4 
corresponds to the region for the large values of yi . There is an essential difference 
between the upper and lower part of the solid curve in Fig. 3. Namely, the last curve 
is close to the dashed line for S > 0.0063 1 and both these curves tend to the geodesic 
line if 6 is growing, whereas the upper part of the solid curve in Fig. 3 significantly 
differs from the geodesic line. The dependence of the 7 on r/M for this case on the 
interval from 3.02 is presented in Fig. 4, and for r ^ 3M the value 7 is proportional 
to ^/r. It means that for the motion on a circular orbit with r ^ 3M the particle must 
posses much higher orbital velocity than in the case of the motion on a circular orbit 
near r = 3M. 

For deeper understanding of physics of the highly relativistic circular orbits of a 
spinning particle in Schwarzschild’s background, let us estimate the values of the 
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Fig. 3 Dependence of the 
Lorentz factor on (5 > 0 for 
the highly relativistic circular 
orbits with difIds > 0 of the 
spinning particle in the small 
neighborhood of r = 3M 
according to the exact MP 
equations and their linear 
spin approximation {solid 
line) and their linear spin 
approximation {dashed line). 
The dotted line corresponds 
to the geodesic circular orbits 




Fig. 4 Dependence of the 
Lorentz factor on r for the 
highly relativistic circular 
orbits with difjds > 0 of the 
spinning particle beyond the 
small neighborhood of 
r = 3M by the exact MP 
equations {solid line). The 
dotted line corresponds to 
the geodesic circular orbits 




particle’s energy E on these orbits. It is not difficult to calculate the values for E in 
the case of the circular orbits in the Schwarzschild background as [22] 



E = m 




T8 - soyiiyi - 




(39) 



It is easy to check that by (39) the energy of the spinning particle on the above 
considered highly relativistic circular orbits is positive and much less than the energy 
of the spinless particle on the corresponding geodesic circular orbits. For example, 
the energy of the spinless particle on the circular orbits with r > 3M tends to oo if 
r 3M, whereas according to (36) the energy of the spinning particle is finite for 
its circular orbits with any r, including r = 3M. In the case of the highly relativistic 
circular orbits of the spinning particle beyond the small neighborhood of 3M it 
follows from (39) that 
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Hence, by (40) we have E'^ (note that the right-hand side of Eq. (40) is positive 

for all values y\ beyond the horizon surface). That is, in this sense one can draw a 
conclusion concerning the strong binding energy for those orbits which is caused by 
the interaction of spin with the gravitational field. 



3.5 Circular Nonequatorial Orbits 



It is not difficult to verify that the MP equations in the standard coordinates of the 
Schwarzschild metric have the partial solution with 



r = const 7 ^ 0, 6 = const ^ 0, tt/2, tt. 



dip 



dt 



—— = const 7 ^ 0 , u = —- = const 7 ^ 0 , 
ds ds 



Sr = const jtz 0^ So = const 7 ^ 0 , = 0 , 

cos 6 > 



/ 3M\ 

5, (l-— 



r sin 0 



= 0 . 



(41) 

(42) 

(43) 

(44) 



Expressions (41)-(44) describe the nonequatorial circular orbits of a spinning particle 
with the constant orbital velocity when the spin lays in the meridional plane when the 
particle orbits the Schwarzschild mass. The necessary particle’s velocity for motions 
on these orbits is determined by 



u = — 



^1/4 _ 2M) 






r^/6s\ sinO\sign(So sin 0) 



[1 + 0 ( 5 )], 



where we note 



/ 2M\ ( 3M\ 



^ COS^ d 
sin^ d 



> 0 , 



(45) 



(46) 



and 5 is the small value as determined by (35). The dependence of the angle 0 on r is 



sin"^ 0=11 — 



X (1 + 0(£)) . 



(47) 
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For the nonequatorial circular orbits it is necessary 0 < sin^ ^ < 1. It follows from 
(47) that this condition is satisfied if and only if 

15 

y M (1 + 0(e)) < r < 3M (1 + 0(e)) . (48) 

Therefore, relationships (45)-(48) give the necessary and sufficient conditions 
for the motion of a spinning test particle on the nonequatorial circular orbits in the 
Schwarzschild background. According to (48), the space region of existence of these 
orbits by the radial coordinate is determined (without small correction of the order 
e) as 

15 

yM <r <3M. (49) 

It follows from (47) that the minimum value of sin^ 0 is achieved at 



and 



5V5 

3v^- 1 



2.35M, 



sin^ 0\n 



0.465, Orr 



43°. 



whereas at r = 235 M and r = 2.5 M wc have from (47) the common values 
sin^ 6» = 0.5 and 6» = 45°. 

We stress that for the realization of the considered nonequatorial circular orbits 
a spinning particle must posses the highly relativistic velocity, in the sense that 

7 ^ ^ 1/s > 1. 

Concerning the correlation between the direction of the particle orbital motion 
and its spin orientation we conclude from (45) that, for example, when sin ^ > 0 
(0 < < 7 t/ 2) the signs of Sq and = d(p/ds slyq opposite. 

The considered circular orbits one can called hovering, because a spinning particle 
hovers above the Schwarzschild mass under the angle ahover = 7 t/2 — 0. According 
to (44) if r is in the small neighborhood of the value 3M, the relationship | Sr I » 
takes place, i.e., in this case the particle’s spin practically lays in the equatorial plane 
and by (47) ahover is close to 0°. Whereas if r is in the small neighborhood of the 
value 15M/7, the relationship |5'6i| ^ \ Sr \ is valid, i.e., here the spin is practically 
orthogonal to the equatorial plane and ahover is close to 0°. Then it follows from 
(45) that if r ^ 15M/7 the expression for the particle’s orbital velocity pass the 
corresponding expression for the highly relativistic equatorial circular orbit with 
r = 15M/7. 

We stress that Eqs. (44)-(47) were obtained from the exact MP equations under 
the Mathis son-Pirani supplementary condition. The MP equations in the linear spin 
approximation have the solutions which describe the nonequatorial highly relativistic 
circular orbits of a spinning particle in Schwarzschild’ s background as well. However, 
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these solutions exist only in the small neighborhood of the radial coordinate r = 3M 
under the small hovering angle. 



3.6 Highly Relativistic Noncircular Orbits 

While investigating the highly relativistic noncircular orbits of a spinning particle 
in the Schwarzschild background it is necessary to integrate the set of the differen- 
tial Equations (31)-(33). Note that this system contains the two parameters E and J 
proportional to the constants of the particle’s motion: the energy and angular momen- 
tum. By choosing different values of these parameters for the fixed initial values of 
ji one can describe the motions of different centers of mass. To describe the proper 
center of mass of a spinning particle in the Schwarzschild background, the method 
of separation of the corresponding solutions of the exact MP equations was proposed 
and realized in explicit relationships for the cases when the initial radial component 
of the particle velocity is much less than the tangential one [18]. The correspond- 
ing motions are considered and discussed in detailed in [18, 22]. Here we present 
three typical figures which illustrate some important features of the highly relativistic 
noncircular nonequatorial motions of a spinning particle in Schwarzschild’s back- 
ground (more illustrations are in [20]). Figures 5 and 6 show the situations when the 
highly relativistic spin-gravity coupling causes the significant attractive and repul- 
sive action correspondingly (the circles with the radius 2 in these figures correspond 
to the horizon line). 

Figure 7 illustrates the cases of the nonequatorial motions when the particle’s 
spin is not orthogonal to the equatorial plane 0 = tt/ 2. More exactly, this figure 
describes 6>-oscillations for different values of the inclination angle of the spin to the 
equatorial plane due to Sr / 0 with = 0. All graphs in Fig. 7 are interrupted 
beyond the domain of validity of the linear spin approximation. Within the time 



Fig. 5 Trajectories in the 
polar coordinates of the 
spinning (solid line) and 
spinless particle (dotted line) 
with the same initial values 
of their 4-velocity 
components: the tangential 
and radial velocities are 
equal to ~ 35 and 10“^ 
correspondingly. The initial 
value of the radial coordinate 
is equal to lOM, eq = 10“^ 
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Fig. 6 Trajectories of the 
spinning {solid line) and 
spinless particle {dotted line) 
with the same initial value of 
the tangential velocity, which 
is equal to the spinning 
particle velocity on the 
circular orbit with r = 3M, 
and with much less radial 
velocity of order 10“^. The 
initial value of the radial 
coordinate is equal to 3M 
and eo = 10“^ 




of this approximation validity, the period of the 6>-oscillations coincides with the 
period of the particle’s revolution by ip. Whereas on this interval the value of the spin 
component is const, just as the components ^ 2 , S^, and the coordinate r. This 
situation differs from the corresponding case of the circular motions of a spinning 
particle that are not highly relativistic. Indeed, then the nonzero radial spin component 
is not const but oscillates with the period of the particle’s revolution by ip. Besides, 
in the last case the mean level of 0 coincides with 90°, whereas in Fig. 7 the mean 
values of 0 are above 90° . According to this figure the amplitude of the 6>-oscillation 
increases with the inclination angle. However, — 90° is small even for the inclination 
angle that is equal to 90° . 



3.7 A Restriction on Using the Tulczyjew-Dixon Condition 

Now we consider some interesting situation which arise when the highly relativistic 
solutions of the MP equations under the Tulczyjew-Dixon condition are studied [21]. 
Let us write the main relationships following from the MP equations at Tulczyjew- 
Dixon condition. The mass of a spinning particle p is defined as 

^ (50) 

and /i is the integral of motion. The quantity V ^ is the normalized momentum, where 
by definition 

A 

= . 



(51) 
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Fig. 7 Graphs of the angle 6 
vs proper time for the 
inclination angle 0° 
{horizontal line 6 = 90°), 

1° {solid line), 10° {dash 
line), and 90° {dash and dot 
line) at eq = 




s[M) 



As the normalized quantities and satisfy the relationships 



uxu^ = 1 , = 1 . 

There is the important relationship between and [40-42] 

1 



= N 



+ 



2fj?A 






UTTpCr 



Spa 



(52) 



(53) 



where 



A 



1 + 



(54) 



Now our aim is to consider the explicit form of expression (53) for the concrete 
case of the Schwarzschild metric, for the particle motion in the plane = tt/ 2 when 
spin is orthogonal to this plane (we use the standard Schwarzschild coordinates). 
Then we have 

m2 =0, M V 0, y^O, u‘^ ^ 0, (55) 



5*2 = 0 , = 0 , 5 *^ ^ 0 . 



(56) 



In addition to (56) by Tulczyjew-Dixon condition we write 



5*"* = - — 5*^, 5^"* = 0, 5^"*=— 5*^ 



Pa 



Pa 



(57) 
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Using corresponding expressions for the Riemann tensor in the Schwarzschild metric, 
from (53) we obtain 




According to (54) we write the expression for A as 

sIm , 

A= 1 + ^(1 -3^3 (59) 



(the quantity M is the mass of a Schwarzschild source). Inserting (59) into (58) we 
get 



u 



1 



u 



3 



U 



4 




(60) 



Similarly to (35), we note 



fir 



(61) 



where s ^ 1. 

The explicit expressions for N we obtain directly from the condition u\u^ = 1 
in the form 



N = A 




o 9 

3y3V^£^— X 



- 1/2 






( 62 ) 
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Inserting (62) into (60) we obtain the expression for the components through 

(y2 = u^ = 0): 

y' = 



^ =u^R(l-2e^yy 
y^ = u^R ^1 , 



y^ = 



(63) 



where 



R 






,M\ 









- 1/2 



(64) 



The main feature of relationships (63) and (64) is that for the high tangential 
velocity of a spinning particle the values become imaginary. Indeed, if 



\u^\ > 



1 



- V6Mr ’ 



(65) 



in (64) we have the square root of the negative value. (As writing (65) we neglect 
the small terms of order 6 ^; all equations in this section before (65) are strict in e.) 
Using the notation for the particle’s tangential velocity u± = ru^ by (65) we write 



|m_l| > — (66) 

e\/6M 

If r is not much greater than M, the velocity value of the right-hand side of Eq. ( 66 ) 
corresponds to the particle’s highly relativistic Lorentz 7 -factor of order \/e. 

This fact that according to (51), (63)-(66) the expressions for the components 
of 4-momentum become imaginary (if fi in (50) is real) is an evidence that 
the Tulczyjew-Dixon condition cannot be used for the particle’s velocity which is 
very close to the velocity of light. In this context it is clear the sense of a result of 
recent paper [44] concerning some solutions of the MP equations at the Tulczyjew- 
Dixon condition which allow acceleration of a spinning particle to the superluminal 
velocity in Schwarzschild’s background. This result is connected with the situation 
when relationship ( 66 ) takes place, i.e., when the Tulczyjew-Dixon condition cannot 
be applied. Under the Mathisson-Pirani condition the corresponding result is not 
allowed by the MP equations. 



Solutions of Mathis son-Papapetrou Equations for Highly Relativistic Spinning Particles 



187 



4 Highly Relativistic Solutions of MP Equations 
in Kerr’s Background 

Some solutions of the MP equations under the Mathisson-Pirani condition in Kerr’s 
background, which describe highly relativistic motions of a spinning particle, posses 
the feature common with the case of the corresponding motions in Schwarzschild’s 
background, both iov a M and a ^ M (here a and M are the rotation parameter 
and the mass of the Kerr metric source; we put a > 0). For example, far from 
the horizon surface, when r ^ Vg, the highly relativistic equatorial circular orbits 
of a spinning particle in Kerr’s background are allowed practically with the same 
dependence of the 7 -factor on r as in Fig. 4 for the Schwarzschild background [23] 
(in [23] the Boyer-Lindquist coordinates are used). In this context we note that for 
growing values r the spin-orbit and spin-spin interactions decrease as 1 /r^ and 1 /r^ 
correspondingly; therefore, for r ^ Vg the influence of the spin-orbit action on a 
particle is prevailing . Other situations arise when a particle is not far from rg. For 
example. Fig. 8 illustrates a case of the circular orbits in the narrow space domain 
near and (these values are the radial coordinates of the co- and counter- 
rotation photon circular orbits correspondingly) at the small value of the parameter 
a. In contrast to Fig. 3 for a = 0, in Fig. 8 there is the upper graph which arise just 
due to a 7 ^ 0, i.e., when spin-spin action is nonzero. Many other figures for different 
correlation of the signs a, Ids, and Sq are presented in [23]. 

Similarly to the case of Schwarzschild’s background, the MP equations have the 
highly relativistic nonequatorial circular orbits in Kerr’s background as well [13]. 
The space domain of these orbits existence increase with the increasing parameter < 2 . 
\f a = M, this domain is determined by 1.89M < r < AM. The maximum value of 
the hovering angle jl — d vs achieved aXa = M and is equal to ~ 63°, in contrast 
to ~ 47° at (2 = 0. 



Fig. 8 Dependence of the 
Lorentz factor on r for the 
highly relativistic equatorial 
circular orbits with 
dip/ds > 0 of the spinning 
particle in Kerr’s background 
dita = 0.0145M, £0 = 10“^, 
and > 0 (solid lines). The 
dotted line corresponds to the 
geodesic circular orbits. Here 
we use the notation r(— ) for 
andr(+)forr^+^ 




r(+) 2.989 



3.01 r(-) 
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Some graphs which describe nonequatorial motions (in particular, the 
6 >-oscillations) in Kerr’s background when its spin is not orthogonal to the equa- 
torial plane are presented in [ 20 ]. 



5 Numerical Estimates 

Do some particles in cosmic rays posses the sufficiently high 7 -factor for motions on 
the highly relativistic circular orbits, or on some their fragments, in the gravitational 
field of the Schwarzschild or Kerr black hole which are considered above? Yes, they 
do. By the numerical estimates for an electron in the gravitational field of a black 
hole with three of the Sun’s mass the value l^o is equal 4 x 10“^^. Then the necessary 
value of the 7 -factor for the realization of some highly relativistic circular orbits by 
the electron near this black hole is of order 10^. This 7 -factor corresponds to the 
energy of the electron free motion of order 10^"^ eV. Analogously, for a proton in 
the field of such a black hole the corresponding energy is of order 10^^ eV. For the 
massive black hole those values are greater: for example, if M is equal to 10^ of the 
Sun’s mass the corresponding value of the energy for an electron is of order 10^^ 
eV and for a proton it is 10^^ eV. Naturally, far from the black hole these values are 
greater because the necessary 7 -factor is proportional to yY. 

Note that for a neutrino near the black hole with three of the Sun’s mass the 
necessary values of its 7 -factor for motions on the highly relativistic circular orbits 
correspond to the neutrino’s energy of the free motion of order 10^ eV. If the black 
hole’s mass is of order 10^ of the Sun’s mass, the corresponding value is of order 
10 ^ eV. 

Can the highly relativistic spin-gravity effects be registered by the observation of 
the electromagnetic synchrotron radiation from some black holes? Perhaps, however, 
it is difficult to differ the situation when the circular orbits of a spinning charged 
particle and its synchrotron radiation are caused by the magnetic field or the spin- 
gravity coupling. The detailed analysis of the observational data is necessary. 



6 On Correspondence Between the Dirac and MP Equations 

Here we point out a question which arise after the comparison of the MP equa- 
tions with the general relativistic Dirac equation. As we note in the Introduction, 
the first equations follow from the second equation in some classical approxima- 
tion. However, this result has been obtained in the linear spin approximation only, 
when expression (3) for the Fermi transported spin practically coincides with the 
corresponding expression for the parallel transport. Note that the general relativistic 
Dirac equation was obtained for eight years before the MP equations and therefore 
the Fermi transport was not discussed in the context of possible description of a 
quantum spinning particle in the gravitational field. At the same time, if one want to 
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satisfy the principle of correspondence between an equation for a quantum spinning 
particle and the MP equations, now it is necessary to take into account the known fact 
that according to the MP equations the spin of a test particle is transported by Fermi. 
However, in common sense, the notation “Fermi-transported spinor” cannot be intro- 
duced without violation of the Lorentz invariance. In this context it is interesting that 
for last years the possibility of the Lorentz invariance violation is discussed in the 
literature from different points of view. In any case (with violation of the Lorentz 
invariance or not), it is necessary to propose a more exact equations for fermions in 
gravitational field than the usual general relativistic Dirac equation (probably, this 
corrected equation must be nonlinear in the t/; -function [45]). 



7 Conclusions 

1 . The MP equations are an important source (in the certain sense, even the unique 
source) of information on highly relativistic motions of spinning particles in the 
gravitational field. 

2. Different solutions of the MP equations in Schwarzschild’s and Kerr’s back- 
ground show clearly that the highly relativistic spin-gravity coupling can change 
significantly the trajectory of a spinning particle as compare to the correspond- 
ing geodesic trajectory of a spinless particle. In particular, the highly relativistic 
spin-orbit interaction reveals the strong repulsive action on a spinning particle. 

3. In the highly relativistic regime of the spinning particle motions in the gravita- 
tional field the adequate supplementary condition for the MP equations is just the 
Mathisson-Pirani condition. 

4. By the numerical estimates, some particles in cosmic rays posses a sufficiently 
high 7 -factor for motions on the significantly nongeodesic orbits near Schwarz- 
schild’s or Kerr’s black holes. 

5. The MP equations can be useful for searching adequate corrections in the known 
general relativistic Dirac equation with the aim to investigate the highly relativistic 
spin-gravity coupling on the quantum level. 



References 



1. M. Mathisson, Acta Phys. Pol. 6, 163 (1937) 

2. A. Papapetrou, Proc. R. Soc. A 209, 248 (1951) 

3. E. Corinaldesi, A. Papapetrou, Proc. R. Soc. A 209, 259 (1951) 

4. A. Das, Prog. Theor. Phys. 17 , 373 (1957) 

5. R. Micoulaut, Z. Phys. 206, 313 (1967) 

6. J.K. Lawrence, Acta Phys. Austr. 30, 373 (1969) 

7. A.R. Prasanna, N. Kumar, Progr. Theor. Phys. 49, 1553 (1973) 

8. C.W. Misner, K.S. Thome, J.A. Wheeler, Gravitation (Ereeman, San Erancisco, 1973) 

9. S. Rasband, Phys. Rev. Lett 30, 111 (1973) 



190 



R. Plyatsko et al. 



10. R. Wald, Phys. Rev. D 6, 406 (1972) 

11. R.M. Plyatsko, Preprint No. 158, Phys. Inst. Belor. Acad. Sci. (1978). (in Russian) 

12. R.M. Plyatsko. Cand. thesis, Phys. Inst. Belor. Acad. Sci., (1979). Minsk (unpublished) 

13. R.M. Plyatsko, A.L. Vynar, Sov. Phys. Dokl. 27, 328 (1982) 

14. R.M. Plyatsko, Manifestations of gravitational ultrarelativistic spin-orbit interaction, Naukova 
Dumka, Kyiv, (1988) (in Ukrainian) 

15. R.M. Plyatsko, Phys. Rev. D 58, 084031 (1998) 

16. R. Plyatsko, O. Bilaniuk, Class. Quantum Gravity 18, 5187 (2001) 

17. R. Plyatsko, Class. Quantum Gravity 22, 1545 (2005) 

18. R. Plyatsko, O. Stefanyshyn, Acta Phys. Pol. B. 39, 23 (2008) 

19. R. Plyatsko, Acta Phys. Pol. B. Proc. Suppl. 1, 173 (2008) 

20. R. Plyatsko, O. Stefanyshyn, M. Fenyk, Phys. Rev. D 82, 044015 (2010) 

21. R.M. Plyatsko, O.B. Stefanyshyn, M.T. Fenyk, Class. Quantum Gravity 28, 195025 (2011) 

22. R. Plyatsko, M. Fenyk, Phys. Rev. D 85, 104023 (2012) 

23. R. Plyatsko, M. Fenyk, Phys. Rev. D 87, 044019 (2013) 

24. F.A.E. Pirani, Acta Phys. Pol. 15, 389 (1956) 

25. C. Mpller, Commun. Dublin Inst. Adv. Stud., A5, 3 (1949) 

26. B. Mashhoon, J. Math. Phys. 12, 1075 (1971) 

27. S. Wong, Int. J. Theor. Phys. 5, 221 (1972) 

28. L. Kannenberg. Ann. Phys., 103, 64 (1977), (New York) 

29. R. Catenacci, M. Martellini, Lett. Nuovo Cimento 20, 282 (1977) 

30. J. Audretsch, J. Phys. A 14, 411 (1981) 

31. A. Gorbatsievich, Acta Phys. Pol. B 17, 111 (1986) 

32. A. Barut, M. Pavsic, Class. Quantum Gravity 4, 41 (1987) 

33. F. Cianfrani, G. Montani, Europhys. Lett. 84, 30008 (2008) 

34. E. Cianfrani, G. Montani, Int. J. Mod. Phys. A 23, 1274 (2008) 

35. Yu.N. Obukhov, A. Silenko, O. Teryaev, Phys. Rev. D 80, 064044 (2009) 

36. V. Pock, D. Ivanenko, Z. Phys. 54, 798 (1929) 

37. V. Pock, Z. Phys. 57, 261 (1929) 

38. H. Weyl, Proc. Natl. Acad. Sci. USA 15, 323 (1929) 

39. K. Thome, J. Hartle, Phys. Rev. D 31, 1815 (1985) 

40. K.P. Tod, P. de Pelice, Nuovo Cimento 34, 365 (1976) 

41. R. Hojman, S. Hojman, Phys. Rev. D 15, 2724 (1977) 

42. S. Suzuki, K. Maeda, Phys. Rev. D 58, 023005 (1998) 

43 . S . Chandrasekhar, The Mathematical Theory of Black Holes (Oxford University Press, Oxford, 
1983) 

44. S.A. Hojman, FA. Asenjo, Class. Quantum Gravity 30, 025008 (2013) 

45. R. Plyatsko, arXiv: gr-qc/0601 1 1 ; arXiv: 1 1 10.2386 



The MPD Equations in Analytic 
Perturbative Form 



Dinesh Singh 



Abstract A longstanding approach to the dynamics of extended objects in curved 
space-time is given by the Mathisson-Papapetrou-Dixon (MPD) equations of motion 
for the so-called “pole-dipole approximation.” This paper describes an analytic per- 
turbation approach to the MPD equations via a power series expansion with respect 
to the particle’s spin magnitude, in which the particle’s kinematic and dynamical 
degrees of freedom are expressible in a completely general way to formally infinite 
order in the expansion parameter, and without any reference to pre-existent space- 
time symmetries in the background. An important consequence to emerge from the 
formalism is that the particle’s squared mass and spin magnitudes can shift based 
on a classical analogue of “radiative corrections” due to spin-curvature coupling, 
whose implications are investigated. It is explicitly shown how to solve for the linear 
momentum and spin angular momentum for the spinning particle, up to second order 
in the expansion. As well, this paper outlines two distinct approaches to address the 
study of many-body dynamics of spinning particles in curved space-time. The first 
example is the spin modification of the Raychaudhuri equation for worldline con- 
gruences, while the second example is the computation of neighbouring worldlines 
with respect to an arbitrarily chosen reference worldline. 



1 Introduction 



It is an undeniable fact that, for all practical purposes, macroscopic astrophysical 
objects in the Universe possess classical spin angular momentum during their for- 
mation and long-term evolution. In addition, it has been known for a long time that 
relativistic astrophysical objects, such as black holes and neutron stars with spin, also 
theoretically exist within the theory of General Relativity, with strong observational 
evidence to effectively confirm their presence within the Universe. Further to this, it 
becomes very relevant to consider, for example, the physical consequences involving 
the spin interaction of relativistic systems, such as the orbital dynamics of rapidly 
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spinning neutron stars around supermassive black holes expected to exist in the 
centres of most galaxies. Therefore, it is vital to properly understand the dynamics 
of extended bodies in curved space-time that incorporates classical spin. 

The first attempt to introduce classical spin within the framework of General 
Relativity was presented by Mathisson [1], who demonstrated the existence of equa- 
tions of motion involving an interaction due to the direct coupling of the Riemann 
curvature tensor with the moving particle’s spin. Besides the so-called “pole-dipole 
approximation” for the case of a spinning point dipole as the leading order spin- 
dependent term alongside the mass monopole term, Mathisson ’s formalism includes 
a non-trivial set of higher-order multipole moment contributions that depend upon 
prior knowledge of the object’s matter-energy tensor field. Several other contribu- 
tions over the years also appear, most notably from Papapetrou [2] whose treatment 
of the problem was to describe the spinning object confined to within a space-time 
world tube that contains its centre-of-mass worldline, such that the associated matter 
field is given compact support. Other contributions, such as from Pirani [3], Tulczy- 
jew [4], Madore [5], and several more have adopted competing perspectives on the 
treatment of higher-order multipole moments. Some years later, Dixon [6, 7] intro- 
duced what is arguably the most complete treatment of this problem by providing 
a fully self-consistent description of all multipole moment contributions to infinite 
order for the dynamics of extended bodies in curved space-time. 

When considering the inspiral orbital motion of an equal-mass spinning binary 
neutron star system, for example, it is reasonable to demand an understanding of 
the higher-order multipole moment contributions when strong tidal disruptions are 
expected to occur to put stresses within the neutron stars’ internal structure. How- 
ever, for situations involving extreme mass-ratio systems, such as neutron stars or 
solar-mass black holes in orbit around supermassive black holes, it is justifiable to 
truncate the multipole moment expansion of the full equations of motion and focus 
exclusively on the pole-dipole approximation. Indeed, for most practical calcula- 
tions involving an extreme mass ratio system, the essential physics of the dynamical 
motion is well-captured, provided that the spinning object’s dimensions are suffi- 
ciently small compared to the background space-time’s local radius of curvature in 
order for this justification to be well-motivated. Taking such an approach leads to 
what are commonly known as the Mathis son-Papapetrou-Dixon (MPD) equations of 
motion. 

The application of the MPD equations for a variety of interesting astrophysical 
situations is evident within the literature. For obvious reasons, it is very natural to 
make use of the MPD equations to describe the dynamics of a spinning particle 
around spinning black holes, as described by the Kerr metric [8-12] to showcase 
the impact of spin-curvature interactions involving black hole spin on the spinning 
particle’s long-term orbital motion. In so doing, the particle then becomes a sensitive 
probe of space-time curvature due to the MPD equations, from which it becomes 
possible to learn a great deal about the properties of the background source. From 
numerical simulations in particular, it is possible to explore the limits of stability 
for the MPD equations, such that deterministic chaos may result under extreme 
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circumstances [12-15]. Furthermore, it is possible within this framework to derive 
predictions of gravitational wave generation [16, 17] expected to occur from spin- 
induced deviations away from geodesic motion. 

Besides the numerically driven treatments of the MPD equations, there also exist 
more formal analytic studies within the literature [18-20]. The direct precursor to 
the analytic perturbative approach presented in this article was first developed by 
Chicone, Mashhoon, and Punsly (CMP) [21], with a focus upon the study of spinning 
clumps of plasma in the formation of as trophy sical jets columnated along a Kerr black 
hole’s axis of symmetry. For future reference, this approach to the MPD equations 
is hereafter known as the CMP approximation. This approach was then applied 
by Mashhoon and Singh [22] to identify analytic expressions for the leading-order 
perturbations for the circular orbit of a spinning particle around a Kerr black hole. 
Given the somewhat basic assumptions within the CMP approximation, the analysis 
is remarkably successful in reproducing the spinning particle’s kinematic properties 
when compared with the full MPD equations. This is self-evident for situations where, 
for spin magnitude 5' and mass m, with radial distance r away from the background 
source, the spinning particle’s Mpller radius [22, 23] is confined to < 10“^ r. 
This agreement, however, begins to break down when s /{mr) ~ 10“^ — 10“^ for 
r = 10 M, where M is the Kerr black hole mass. It suggests strongly that higher- 
order spin-curvature coupling terms are necessary to more completely account for the 
orbital motions when compared with the numerical treatment of the MPD equations. 

On the basis of this observation, a full generalization of the CMP approximation 
was introduced by Singh [24] to account for the need to have higher-order analytic 
contributions within the perturbation treatment. As will be shown within this paper, 
several attractive features emerge with the additional terms incorporated. First, the 
formalism in terms of a power series expansion with respect to the particle’s spin 
magnitude is extended to formally infinite order. Second, it is background indepen- 
dent and accommodates for arbitrary motion of the particle without any recourse to 
space-time symmetries within the metric. As such, it follows that this generalization is 
very robust and can be applied to a wide variety of astrophy sical situations, examples 
of which are the modelling of globular clusters and other many-body dynamical sys- 
tems in curved space-time. Explicit applications of this perturbation approach were 
presented by Singh to model the circular motion of a spinning test particle around a 
Kerr black hole [25] and that of a radially accreting or radiating Schwarzschild black 
hole, as described by the Vaidya metric [26]. Third, the formalism allows for the 
existence of a classical analogue for “radiative corrections” that shift the particle’s 
overall squared mass and spin magnitude with respect to some “bare mass” and “bare 
spin,” respectively. Such an observation under extreme conditions may well yield the 
ability to analytically determine the transition from stable to chaotic motion, which 
can be compared to existing studies [12-15] that involve numerical methods. 

This paper presents the analytic perturbation treatment of the MPD equations 
based on the following outline. It begins in Sect. 2 with a general presentation of the 
MPD equations and its properties. Following that. Sect. 3 presents the perturbation 
approach based upon the CMP approximation. Afterwards, Sect. 4 gives a descrip- 
tion of how to explicitly obtain the linear momentum and spin angular momentum 
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terms for a spinning test particle, up to second order in the perturbation expansion. 
Prior to the concluding remarks, Sect. 5 presents two lines of study for a many- 
body extension of the MPD equations that can incorporate the analytic perturbation 
approach. The first one concerns spin-induced modifications of the Raychaudhuri 
equation for a congruence of worldlines, while the second one involves the compu- 
tation of neighbouring worldlines for spinning particles determined with respect to 
a reference spinning particle. In order to account for the anticipated spin-spin inter- 
actions to appear within this many-body extension, a novel approach is employed in 
the form of introducing a spin-induced torsion field satisfying the Riemann-Cartan 
1/4 geometry along the reference particle’s worldline, details of which are explained 
in due course. 

Throughout this paper, the metric convention is 4-2 signature with geometric 
units of G = c = 1 utilized. The Riemann tensor and its contractions are defined in 
accordance with Misner et al. [27]. 



2 The MPD Equations in General Form 
2.1 Equations of Motion 



Assuming the pole-dipole approximation for a spinning pointlike object, the MPD 
equations for its linear four-momentum P^(r) and spin tensor S^^{r) are described 
by 



dr 2 


(1) 


dr 


(2) 



where (1) is the spin-curvature force equation due to the coupling of the Riemann 
curvature tensor with plus the particle’s four- velocity vector w^(r) = 

dv^(r)/dr with affine parametrization r, while (2) is the wedge product of the four- 
velocity with the four-momentum. An important consequence of the MPD equations 
is that the particle’s four-momentum necessarily precesses around the centre-of-mass 
worldline. The affine parameter r is formally left unspecified, but it is always possible 
to have it identified with proper time, such that = — 1 . When going beyond the 
pole-dipole approximation, (1) and (2) each have additional terms of the form and 

q-ap [18, 22], respectively. These contributions in turn require detailed knowledge of 
the spinning object’s energy-momentum tensor [6-8, 22], satisfying covariant 
conservation = 0. For the purposes of this paper, however, the expressions 

for (1) and (2) are sufficient. 
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2.2 Constraint Equations 



By themselves, the MPD equations (1) and (2) are underdetermined. They require 
additional equations to specify the system, up to a parametrization constraint for r. 
Specifically, it is necessary to determine some type of relationship involving the 
spin tensor that allows for the spinning particle’s evolution to be understandable. 
Arguably, the most “reasonable” spin condition to be utilized, following Tulczyjew 
14] and Dixon 16, 7] is to enforce orthogonality between the spin tensor and the linear 
four-momentum, such that 



5 "^ Pf3=0. 



( 3 ) 



It follows from use of both (3) and the MPD equations that the mass and spin 
magnitudes m and 5', defined according to 

m2 = 

0 1 

.2 _ _ o rfliy 
— 2 ’ 

are both constants of the motion 121]. 

In addition to this fact, use of (3) gives rise to determining the four-velocity 
with respect to and according to Tod et al. 110], such that 



( 4 ) 

( 5 ) 






I 1 P^ 

2 + I SP\ 



( 6 ) 



As noted above, the only remaining undetermined quantity is P • u, which relates 
the particle’s internal clock with respect to r. While solving for P • u in terms of 
the normalization equation Uf^ = —1 is obviously possible, it is also possible to 
impose a different constraint on r, since the choice for P -u has no effect on the locus 
of the particle’s worldline. For example, if it is desirable to measure time evolution 
in terms of a distant observer’s clock, then letting = dt /dr = I and solving for 
P • M is a perfectly valid procedure to follow. 

All these equations presented above will become useful for deriving the analytic 
perturbative approach to the MPD equations, based upon the CMP approximations 
to be presented next 124] . 



3 Analytic Perturbation Approach to the MPD Equations 
3.1 CMP Approximation 



The approximation of the MPD equations first introduced by Chicone et al. [21, 22] 
is based upon the assumption that PP — mu^ = is small, where is the spin- 
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curvature force. Further to this, the M0ller radius p [21-23] is chosen to be a small 
quantity, in terms of p = s/m where r is the distance from the particle to the 
source of the gravitational background. When taken together, the CMP approximation 
emerges as a series expansion to first-order in 5', in the form 



According to (8), the spin tensor is parallel transported and the spin condition (3) 
becomes 



which agrees with the Pirani condition [3] for orthogonality between the spin tensor 
and the particle’s four- velocity. 

It is surprising to note that the CMP approximation is remarkably accurate for 
modelling the motion of a spinning test particle in curved space-time, with spin 
orientation as given in Fig. 1. This is illustrated very well when applied to circular 
motion around a Kerr black hole [22] and compared to the full MPD equations for 
s/{mr) ~ 10“^ and r = 10 M. However, when pushed to the more extreme situation 
of (mr) ~ 10“^ — 10“^ for the same choice of r, abreakdown appears, as evidenced 
by Fig. 2 with the lack of any spin-induced modulation of the particle’s r-dependent 
radial position using the CMP approximation compared to the full MPD equations. 
This observation justifies the need for a generalization of the CMP approximation to 
be given below [24] . 




(7) 



( 8 ) 






(9) 



z 




y 



X 



Fig. 1 Initial spin orientation (O, />) for a spinning particle in circular motion around a black hole. 
The observer is located along the x-axis 
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Fig. 2 The orbital radial co-ordinate r^(r) for s/{mr) = 10 \ where m = 10 ^ M, r = lOM, 
6 = ^ = 7t/4, and a = 0 . 50 . a s/{mr) = 10 “^ (MPD). b s/{mr) = 10 “^ (Linear) 



3.2 Generalization of the CMP Approximation 

To develop the generalization of the CMP approximation [24, 25], the linear momen- 
tum and spin angular momentum are expressed in terms of a power series expansion 
in the form 



j=o 

OO OO 

. ( 11 ) 

j=0 j=i 

For (10) and (1 1), 6 is an expansion parameter that is identified with 5', with and 

the respective jth-order contributions in 5 of the linear momentum and spin 
angular momentum. For the zeroth-order expressions in 5, they correspond to the 
dynamics of a spineless particle in geodesic motion. In addition, it is also assumed 
that the four- velocity is described as 



u^(e) u'^jy ( 12 ) 

;=o 

When substituting (10), (11), and (12) into the MPD equations expressed as 
dr z 



£>5“^(£r) 



= 2eP^^{e) «PJ(e) 






(13) 

(14) 



dr 
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where for consistency an extra factor of 6 is introduced in (14), the jth-order expres- 
sions of the MPD equations become 



DP[ 



U) 



of^ 

( 0 ) 



^(0) 



dr 


2 


^‘^(7-1) 

dr 


7-1 

= 2Z 


k=0 


, where 





7-1 



k=0 



^a(3 



/^] 



^0 — ^( 0 ) ’ 



it follows that the zeroth-order term in 5 is 



DP^ 

^^(0) 

dr 



0 , 



(15) 

(16) 



(17) 



(18) 



while the respective first-order terms for the linear momentum and spin are then the 
CMP approximation, since 



DP 



(1) 



dr 



^^(0) 

dr 



2 ^ uaP ^(0) ‘^(O) ’ 

0 . 



(19) 

( 20 ) 



To complete the generalization, it is necessary to determine as a function of 
the linear and spin angular momentum expansion terms. To do this requires use of 
the supplementary equations (3)-(6), with important consequences to follow. From 
the spin condition (3), it is straightforward to show in terms of (10) and (11) that 

7 

p(0) p{k) /9i\ 

k=l 

for the (j -h l)th-order contribution in 5, where the first-order perturbation in 5 is 

( 22 ) 

It is possible to identify a classical analogue for a bare mass mo defined by (17) and 
a bare spin according to 




c(0) 

^(0) ’ 



(23) 
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in analogy with the radiative corrections identified with the bare mass of quantum 
field theory, given that the m and are dependent upon and 5^^, which are 
represented by (10) and (11), respectively. Therefore, it follows that the total mass 
and spin magnitudes can be identified as “radiative corrections” to mq and sq, such 
that 



are dimensionless jth-order corrections to and Sq, respectively. Since the and 

s'^ are already constant within the exact set of MPD equations, it follows that and 

sj must be individually constant for each order of 6. 

With the four- velocity parametrized by 5 in the form 



it can be expanded out once the yet undetermined scalar product P • u is specified. 
A particularly useful choice is 




(24) 



(25) 



where 




(26) 




(27) 




(28) 



^ 1 - 



(29) 



P ’ u = 



(30) 



which leads to 



M^(e)M^(e) = -1 + 



1 



Ru{e) Ri^ie) 



1 “h 7 ^ Rn 

4m6(e)A2(e) ^ 

1 + 0(eb, 



(31) 
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where 



Ri^ie) ^ S>^\e)R^^ai5 PHe) S^^{e) . (32) 



Therefore, to at least third-order in 5 , it is shown from (31) that r is the proper 
time for parameterization of the particle’s centre-of-mass worldline in accordance 
with (30). A resulting tedious but otherwise straightforward calculation shows from 
substituting (10), (1 1), and (24) into (28) that the spinning particle’s four- velocity is 



'(e) - V e-' - — + e\—(p^ 

” [mo ( W 2 1 (OVJ 



^2 pM _ 

^( 2 ) 9 "^1 ^( 1 ) 



_|_ J? r, p'^ I 

^ Irn^ 



- 5 (“3 - 5 “i “2 + J ™l) '’(0)] 



( 1 ) 



+ 



1 

2m, 



3 ^h"jal3 



2 

p7 _ _ m2 p7 c«/^ 

‘^(1-n) Un-k) ^ik) 2 ^ (0) ^(0) ^(0) 



.n=0 



k=0 



+ 0(s^) „ 



(33) 



which satisfies (31) to third order in 5 . 

At this point, the worldline of the spinning particle can be explicitly determined by 
simple integration of (33), since it provides an iterative means to explicitly evaluate 
uf^(e) order by order in 6, based upon solving (15) and (16) in terms of quantities 
defined for lower orders of 6. 



3.3 Perturbations of the M0ller Radius and Stability 
Implications for Particle Motion 



An important study with regard to the analytic perturbative approach to the MPD 
equations involves the study of transitions from stable to chaotic motion for spinning 
particles in curved space-time. From strictly numerical considerations [12-15], there 
is ample evidence to expect that there exists a critical point when, for sufficiently 
large 5', the motion becomes chaotic. Indeed, when considering many-body particle 
dynamics for astrophysical systems, the importance of detecting chaotic dynamics is 
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very relevant. The pertinent question then concerns how to identify from presumably 
knowable and independent astrophysical parameters, such as the mass and spin of 
each constituent particle, the precise combination of factors that can trigger the 
transition from stable to chaotic motion. 

It is understood from, for example, studying the case of a single MPD particle in 
circular orbit around a Kerr black hole [22] that the relevant parameter for determining 
the relative strength of the spin-curvature interaction to deviate from strictly geodesic 
motion is the Mpller radius p — sIm. When p ^ r for the particle orbit’s radius of 
curvature, the capacity for entering an unstable region of phase space is minimized. 
For the choice of astrophysically realistic spins for MPD particles under numerical 
simulations [14, 15], it is shown that the particle motion never approaches instability. 

Until now within this formalism, it is unclear how to extract useful analytic infor- 
mation about stability properties of MPD particle motion. Given the fact that the 
mass and spin parameters can shift in magnitude according to (24) and (25), this 
will have an impact upon the strength of the Mpller radius, which in turn impacts 
upon the capacity for the spin-curvature interaction to induce unstable motion under 
potentially realistic physical situations. Therefore, determining the perturbations in 
p in analytic form may provide useful insight in determining the precise conditions 
to generate the transition from stable to chaotic motion for a single MPD particle. 

It is straightforward to show from using (24) and (25) that 






s{e) 

m(s) 



so 

mo 



6 S 




+ S 



3 




Si mi — 









+ 0(eb 



(34) 



where the “radiative corrections” due to the second- and third-order contributions 
in 5 cause a shift away from the zeroth-order Mpller radius po = so /mo defined 
by the “bare mass” mo and “bare spin” sq. What is important to note — perhaps 
not surprisingly — is that the corrections mj and sj counteract each other’s effects, 
at least on a formal level. The precise nature of the shift from po to p requires 
determining (34) in reference to a specific background and appropriately defined 
initial conditions. However, it is clearly evident that variations in the Mpller radius 
are precisely identifiable, which can then be correlated with the computation of 
Lyapunov exponents and Poincare maps, for example, while performing stability 
analysis studies using standard techniques available. 
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4 Evaluation of the Linear Momentum and Spin 
Angular Momentum Terms 

4.1 Local Fermi Co-ordinate Frame 



With the formalism behind the analytic perturbative approach to the MPD equations 
now established, it is possible to explicitly evaluate the linear momentum and spin 
angular momentum expansion terms [25] in (10) and (11). As noted earlier, this is 
accomplished in an iterative fashion by solving the first-order perturbation terms 
with respect to zeroth-order quantities, followed by solving the second-order and 
higher terms with respect to lower-order quantities. It becomes a very straightfor- 
ward process to achieve this by framing the problem in terms of the tetrad formalism 
and Fermi normal co-ordinates [22], with the obvious benefit of expressing all con- 
tributions with respect to a freely falling worldline in a locally fiat background. The 
metric deviations in Fermi normal co-ordinates to leading order are then proportional 
to the projected Riemann curvature tensor evaluated on the worldline. 

To proceed, first start with an orthonormal tetrad frame with the orthogonality 
condition 



such that the tetrad satisfies parallel transport in the general space-time background 
described by co-ordinates with 



dr 



= 0 , 



(36) 



and where a denote indices for the Fermi co-ordinates on a locally fiat tangent 
space. The Riemann curvature tensor in the Fermi frame is then described by 

^ ~ • ( 37 ) 



4.2 Zeroth-Order Contributions to the Linear 
Momentum and Spin Angular Momentum 

By ensuring that in the usual fashion and making use of (35) along with 

the leading-order spin condition 5^^ = 0, it becomes evident that 

‘ 3 ( 0 ) — A I A J 0 ( 0 ) , 



(38) 

(39) 
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where = mq and is a constant- valued spatial antisymmetric tensor, with 

components to be determined from initial conditions. 



4.3 First-Order Perturbation Terms 

For the linear momentum, the first-order perturbation in 5 is very straightforward 
to evaluate. With (19) and (36), it is obvious that DP^^^/&r = 
resulting in 



Apa 

dr 



= --^R^ 



' cU 

or; ‘^(O) • 



This is immediately integrable, leading to 



P^ — — - A^- 

^(1) - 2 ^ 



4 ) 



dr . 



(40) 



(41) 



Because of the tetrad formalism employed within this evaluation, it follows imme- 
diately upon contracting (41) with P^^ that 

= 0 (42) 



for a general space-time background. This immediately leads to simplified expres- 
sions for (33) and (34), expressed as 



00 

u^{e) = [^( 0 ) ^ 

2 '"-'®] + 5 ^ 

1 , 

2 "‘ 2 ''( 1 ) - 2 "* 3 ^( 0 ) 



i=o 
+ £2 



1 

mo 

1 






2ml 



mo 



nU 1 -2 dM ^ -2 nfj' 

^(2") r ^9 r Pf( 






n=0 



y ^=0 



+ 0(£^) 



(43) 



and 



P(S) = 



mo 



e + e 



^-2 

2 "! 



+4 





+ 0(eh 



(44) 
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In contrast to the linear momentum to first-order, the corresponding expression 
for the spin tensor is considerably more complicated to evaluate. It is important first 
to note that, following (16) for j = 2, 



D 

^^(1) 

dr 



= 0 . 



(45) 



When 5'^*^ is described in terms of its tetrad projection, it is determined that 



\li \y 

^(1) — ^ a ^ ^ ^(1) 

= + (46) 

By using the spin condition relation (21) for 7 = 1 and the tetrad projection for (41), 
it is shown that 




1 

mo 



p(l) c^J 



(47) 



Regarding the components in (46), they can be formally determined with respect 
to the “radiative correction” term according to (27), such that 




^-2 



‘^(O) ' 



(48) 



At this point, a problem emerges with this approach, in that (46) still depends upon 
sj, which remains an undetermined parameter to be computed. Given the discovery 
that m^ = 0 for the first-order mass shift correction, it would be tempting to simply 
make the same assumption about the corresponding spin shift correction. However, 
since s\ only needs to be covariantly constant, such an assumption is not well justi- 
fied. This leads to the requirement of explicitly evaluating (45) independently of the 
tetrad in terms of the six-dimensional matrix differential equation^ 



DS\ 



( 1 ) 






dS, 



( 1 ) 

fliy 



dr 



+2<0) 



pP e(l) _ 0 



for a column vector of 



(sff) = (. 



_ /c(l) c(l) e(l) c(l) e(l) e(l) 



c 

^01 ’ ^02 ’ ^03 ’ ^12 ’ ^23 ’ ^31 

,ap 



')■ 



(49) 



The solution to 



(49) is then used to evaluate the tetrad projected spin tensor and sf. 



^ A previous approach was reported in [25, 26] using a three-dimensional column vector of purely 
spatial components of in combination with the spin condition to algebraically evaluate the 
remaining three components. In retrospect, this previous treatment is not viable in comparison to 
the approach presented in this paper. Therefore, this new perspective replaces the older one presented 
in [25, 26]. 
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4.4 Second-Order Perturbation Terms 

Once the challenge of evaluating the first-order perturbation term for the spin tensor 
is overcome, the second-order perturbation terms are easily determined. For example, 
from solving (15) for j = 2, the second-order linear momentum term is 

< 2 , = -5 ^'4/ 4 , 4 + 4) ■ <50) 

which is the sum of terms with tetrad projections in both along the worldline and 
orthogonal to it. Therefore, it follows from (26) for j = 2 that m 2 7 ^ 0 in general. 
Similarly, for the spin angular momentum, the corresponding second-order expres- 
sion for (16) is 



D 

^^( 2 ) 

dr 



pU^ r^lcf D p7 oCf/5 

^3 (0) ^(0) ^cT'yap ^(Q) ^(0) ’ 

rrlr. 



which when integrated leads to 



_ 

^^( 2 ) “ 






J F 



R 



S^L dr . 



(0) ^(0) 



(51) 



(52) 



As well, S 2 7 ^ 0 in general. Further evaluation to higher orders in 5 also suggests 
no obvious challenges to overcome, so the procedure can be extended indefinitely to 
any desired order. 



5 Moving Towards an MPD Many-Body Description 
of Classical Spinning Particle Dynamics 

5.1 Research Motivations 



It would be very interesting to develop an extension of the analytic perturbative 
approach for the MPD equations that can be applied towards a many-body description 
of classical spinning particle dynamics. This is important to understand because 
virtually every composite astrophysical system, such as spiral galaxies, globular 
clusters, galactic superclusters, accretion disks around black holes, and so forth are 
all built in terms of components with spin angular momentum incorporated. These 
systems are often modelled using only many-body Newtonian gravitation, which 
may be a reasonable assumption under certain predetermined circumstances, but 
seems inadequate to address the full range of physical consequences that may be 
experienced due to relativistic effects that are necessarily excluded. 
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The essential intent for investigating a many-body description of classical spin- 
ning particles in curved space-time is to better understand the interaction dynamics 
between two or more spinning particles in relative motion with respect to one another. 
Given the interest shown in understanding the transition from stable to chaotic motion 
for a single particle, this consideration becomes even more relevant than before when 
dealing with a many-body system in curved space-time, in which the possibility of 
instabilities may emerge under astrophysically realistic conditions. As well, by com- 
pressing the many-body system into a very small volume of space, it seems possible 
to investigate the continuum limit for spinning matter subject to the MPD equations 
as applied to each unit cell endowed with spin. 

This section presents the first stage of some long-term research [28] on under- 
standing the extension of the MPD equations to incorporate many-body spinning 
particle systems. For now, it involves understanding the evolution of a single neigh- 
bouring worldline due to spin, determined with respect to some reference worldline 
in the following two contexts: 

1. Ray chaudhuri Equation: 

^ 1 aa() + Wa/3 - Ra0 , (53) 

2. Generalized Jacobi Equation for tidal dynamics: 

+ + = (54) 

where 0 is the expansion, is the shear, and is the vorticity associated with a 

congruence of worldlines in (53), and where the unspecified extra terms in (54) refer 
to higher-order contributions involving the covariant time derivative of the separation 
vector between neighbouring worldlines. 

In both contexts, there lies an important physical challenge to understand, which 
concerns the fact that the spinning particles moving along separate worldlines must 
necessarily interact with each other, which then significantly impacts upon their 
evolution in space-time. To put it another way, it is not reasonable to expect that 
one can just make multiple copies of MPD particles that are distributed throughout 
a given curved space-time background and have their worldlines determined solely 
by the background curvature. This issue can be best appreciated by appealing to the 
analogy of magnetostatics involving the attractive force felt by two parallel currents 
in a steady state, or conversely the repulsive force felt by two anti-parallel currents. 
Therefore, it is necessary to find an avenue that allows for the individual spinning 
particles to have non-trivial interactions. 
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5.2 Locally Induced Einstein-Cartan Interaction 



In order to address the challenge of incorporating the interactions between two spin- 
ning particles that are separately subject to forces and torques as described by the 
MPD equations, it is useful to consider the situation in terms of the overall kinematics 
first without necessarily modifying the equations themselves.^ 

To this end, recall (28) and (29) with parametrization constraint (30) for the four- 
velocity u^{e). Then the corresponding acceleration a^{e) for the spinning particle, 
expanded out in 5, would formally read as 



a^{e) 



_ Du^(e) 
dr 

= 5 Vq; -h j 

-\-e^ U^2) + ^fo)] • 



(55) 



Consider now a situation involving two spinning particles propagating in a curved 
space-time background. Because both particles are subject to forces and torques 
generated by the MPD equations, their worldlines are not geodesics. At the same 
time, because they are independent particles with distinct physical properties to 
them, their responses to the spin-curvature interaction are obviously expected to be 
distinct. 

Now choose one of these two worldlines to act as a reference with respect to the 
other one. For the particle on that reference worldline, its propagation with respect 
to the background space-time can be determined in accordance with Fermi- Walker 
transport [29], such that 



dr 

where 

pM pM j_ 

^ (FW)ap ~ ^ ^ {FW)a(3 ’ 

^{FW)a(5 ~ j ’ 

and F^^^^ is the metric connection for the space-time background. 

At this point, it is important to make the following observation. From the per- 
spective of an observer co-moving with the reference particle, it would appear from 



(56) 

(57) 

(58) 



^ As presented, the MPD equations implicitly assume that all interactions are “local” in their descrip- 
tion, at least within the context of General Relativity. If evidence emerges to show that gravitational 
interactions are necessarily “non-local” by some defining criterion, then it would follow that many- 
body spinning particle interactions as described here must incorporate non-locality. By implication, 
an appropriate modification of the MPD equations would then be well-motivated. 
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(56) that the particle is propagating along a geodesic in a perturbed space-time 
background due to particle acceleration generated by the MPD equations. Further to 
this perspective, it is self-evident from (58) that the Fermi- Walker connection (57) 
is not symmetric in its lower two indices, yielding an antisymmetric contribution. 
Therefore, it follows that is interpreted as the contortion tensor, with an 

induced torsion field generated by the reference particle that gets encoded within a 
new curvature tensor determined by (57). 

The significance of this approach is that, in the instant of proper time as determined 
for the reference particle, the neighbouring particle is now subject to an MPD spin- 
curvature interaction involving this new curvature tensor with induced torsion due 
to the reference particle’s acceleration. Furthermore, the geometry is now an instan- 
taneous Riemann-Cartan [30], with the acceleration-dependent terms giving rise 
to what will now be called a locally induced Einstein- Cartan (EC) interaction. 

Further details about the locally induced EC interaction and its subsequent appli- 
cation for the rest of this paper are deferred to a future publication [28]. However, 
it is worthwhile to show its presence in terms of the metric tensor of space-time in 
local Fermi normal co-ordinates (T, X).^ To linear order in the reference particle’s 
acceleration, with = aj from the induced EC interaction labelled with boldface 
to distinguish these acceleration terms from others that arise elsewhere, it is shown 
that 



with the metric tensor having now both symmetric and antisymmetric components, 
in which the latter set are dependent upon . 

It is very important to point out that, while the induced EC interaction is a purely 
frame-dependent effect akin to a fictitious force, the new curvature tensor based upon 
(57) is perfectly well-defined because (58) is a tensor. For this reason, the explicit 
time-dependent contributions in (59) and especially (62) are labelled with AT, to 
symbolize that these contributions are ephemeral and can only persist within a limited 
time scale. In terms of the physical interpretations that can be ascribed to the EC 
interaction, it can be understood as something analogous to the Lense-Thirring effect 
that causes the precession of a nearby gyroscope due to the spin rotation of a massive 
body. This also fits well with an interpretation of Einstein-Cartan theory in terms of 
defects within space-time in like manner to that of classical defect theory concerning 



^For ease of notation, the indices are left unhatted. 





(60) 



( 61 ) 



(62) 

(63) 
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internal stresses within an elastic medium [31]. From such an interpretation, the 
induced torsion due to the reference particle causes a space-time defect that then has 
to be restored to its original state once the particle leaves that space-time region. 



5.3 Spin Modifications of the Raychaudhuri Equation 

When using the spin-perturbed space-time background to compute the Raychaudhuri 
equation, it takes the form 

1 n0 ^ . p(FW) a „0 

— ~ ^a(3 ^ ^ 

- 2a“ {aoi0 - U)a0) , (64) 



where 



0(FW) ^0 ^ y C 



UJ 



(FW) 

^ a (3 

{FW) 

a(3 



— <^0/3 + (^{a ^/3) — 



^{^Ua) - -Ohap 



+ (^{a ^/3) » 



^a(3 + ^(3] = "^[[3 ^a] + W/3] . 



(65) 

( 66 ) 

(67) 



are the modified forms of the expansion, shear, and vorticity with ha [3 = + UaUf 3 , 

such that = 0 and h^a = 3, while 

d(EW) a /3 _ d( 0) j.P _ ^ 

^a(3 ^ ^ — ^q;/3 ^ ^ 2 ^ 

shows that the modified gravitational term includes the spin-dependent acceleration 
with opposite sign compared to the background Ricci term. This is interesting because 
it shows that the spin-curvature interaction causes a defocussing of the congruences. 
In particular, such a result may have relevance for cosmology as a way to account 
for a small repulsion that is currently attributed to dark energy, assuming that a 
sufficiently large MPD spin-curvature force can be generated due to the coupling of 
galactic spin with the cosmological space-time background. More details need to be 
examined to determine whether this possibility is physically realistic. 



5.4 Generalized Jacobi Equation and Computation 
of Neighbouring Worldlines 

The other approach towards finding a many -body description of spinning particles 
is to use the generalized Jacobi equation as a computational tool for determining a 
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neighbouring worldline with respect to a reference worldline. Since this approach 
[32, 33] specifically computes neighbouring geodesics with respect to a reference 
geodesic in a general space-time background, it can also be applied for the case of 
the perturbed space-time background as described by (57). 

Suppose that a is the parameter identifying a family of worldlines x^(r; a) in 
a given space-time background, with the reference worldline as x^(r; 0) = x^(r). 
Then the neighbouring worldline is related to the reference in terms of a Taylor 
expansion with respect to a = 0, such that 



/, /, dx^(r) 1 d^x^(r) ^ ^ 

x^{t\ (j) = x''(t) H cr + - ^^2 



= x''(r)+C'‘(r)cr 

1 



+ 






cr^ + 0(<7^) , 



(69) 



where 



eir) 



dx^(r) 

da 



is the separation vector between the neighbouring worldlines and 



(70) 



k^(r) = . (71) 

da 

The challenge is to solve for (70) and (71) for determining (69) to the desired order 
of the expansion in a. 

This study is currently a work in progress [28]. Nonetheless, it is still possible to 
outline the basic approach to solve for (70) and (71) necessary to obtain (69). This 
first point is to assume, just as in (10) and (11), that (70) and (71) can be written in 
the form 



oo 





(72) 


7=0 




oo 






(73) 


7=0 





Focussing only on the first-order contribution in (69), the next step is to deter- 
mine the generalized Jacobi equation (54) in order to determine the proper time- 
dependence of This is best accomplished by solving the equation of motion 
for a test particle in a local Fermi frame [34, 35], in which the right-hand side is 
the acceleration of the neighbouring particle, denoted by the subscript “[1]” in what 
follows below. By again describing the background space-time in terms of the Fermi- 
Walker connection (57) and the locally induce EC interaction, it can be shown that 
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the generalized Jacobi equation is 



+ ^(FW)al3j “ ? “ 



dr^ 



+ 2 I (S^i, + Mjy) + 



dr 



‘^(FW)a0^ ? 






fl 

EC 



0 , 



( 74 ) 



where 



^accel. 



(S^i, + Uu) {a’' - a["i]) H 



D{fw)C 



D(FW)a'^ D(fw)C 



-2a" + 



D(FW)d‘^ 

da 



[1] , r>(f w)Qg ^ \ D(fw)C D(fw)^0 D(fw)C 



+ („IU + 52^) 



dr 



dr 



dr 



- 2^ a^) 



dr 









dr dr 

(EW)^ 
dr 



- 8^ [«'. vir »,i - « *' ^ir «^i] ^ («‘“ “®) <”) 



and 



- 



+ »'• «.) 1& V}?^ .«! - (, + A, (a- - 



+ 



dr 



[« «^^1 
(FW) ^ 

dr 



_ a„-2AT£™««'r<™ 



) 



D(fw)C 

dr 



, / D(FW)^a D(FW)aa\ D(fW)C 



( 76 ) 



212 



D. Singh 



with the neighbouring particle’s acceleration already Lie transported for immediate 
evaluation. It is worth recalling that the neighbouring particle’s acceleration is deter- 
mined by use of the MPD equations, but with use of the modified Riemann curvature 
tensor instead of for the background space-time. 

The final step required is to substitute (72) and all other expansions of 5 into 
(74), to then solve for each order of 5. The difficult part of the task is to evaluate the 
zeroth-order contribution for which applies for geodesics, but once that contri- 
bution is determined, all the others can be obtained without difficulty through simple 
integration, at least in principle. It remains to be seen [28] whether there are any 
further technical challenges in determining (69), but it appears that the conceptual 
difficulties are overcome. 



6 Conclusion 

This paper illustrates the foundations and possible applications for the analytic per- 
turbative approach to the MPD equations. It should be apparent that the approach is 
powerful, in that it is both background independent and applicable to any conceivable 
type of motion for the spinning particle. Furthermore, the outline of future work on 
many-body particle dynamics shows its potential to model many-body astrophysi- 
cal systems that can be compared with observations. Nonetheless, it should also be 
noted that the formalism is always subject to refinements. For example, it would 
be useful to expand upon this perturbative approach to incorporate the quadrupole 
moment interaction of these otherwise pointlike particles, which would undoubt- 
edly contribute to non-trivial motion under more extreme physical situations, such 
as orbital dynamics near the event horizon of a black hole. As well, the formalism 
does not take into account backreaction effects due to the gravitational self-force, 
which seems relevant for addressing more sophisticated considerations than outlined 
in this paper. These modifications do not seem very difficult to incorporate within the 
existing formalism. In its present form, however, there is already a lot of possibilities 
for useful applications to follow. 
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Center of Mass, Spin Supplementary 
Conditions, and the Momentum 
of Spinning Particles 



L. Filipe O. Costa and Jose Natario 



Abstract We discuss the problem of defining the center of mass in general relativity 
and the so-called spin supplementary condition. The different spin conditions in the 
literature, their physical significance, and the momentum- velocity relation for each of 
them are analyzed in depth. The reason for the non-parallelism between the velocity 
and the momentum, and the concept of “hidden momentum”, are dissected. It is 
argued that the different solutions allowed by the different spin conditions are equally 
valid descriptions for the motion of a given test body, and their equivalence is shown 
to dipole order in curved spacetime. These different descriptions are compared in 
simple examples. 

1 Introduction 

An old problem in the description of the dynamics of test particles endowed with 
multipole structure is the fact that, even for a free pole-dipole particle (i.e. with a 
momentum vector and a spin 2-form as its only two relevant moments) in fiat 
spacetime, the equations of motion resulting from the conservation laws = 0 do 

not yield a determinate system, since there exist 3 more unknowns than equations. The 
so-called “spin supplementary condition”, = 0, for some unit timelike vector 

first arose as a means of closing the system, by killing off 3 components of . Its 

physical significance remained however obscure, especially in the earlier treatments 
that dealt with point particles [1-4] (see also in this respect [5]). Later treatments, most 
notably the works by Moller [6, 7], dealing with extended bodies, shed some light on 
the interpretation of the spin condition, as it being a choice of representative point in 
the body; more precisely, choosing it as the center of mass (“centroid”) as measured 
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in the rest frame of an observer of 4-velocity — since in relativity, the center of 
mass of a spinning body is an observer-dependent point. Different choices have been 
proposed; the best known ones are the Frenkel-Mathisson-Pirani (FMP) condition 
[8, 16], which chooses the centroid as measured in a frame comoving with it; the 
Corinaldesi-Papapetrou (CP) condition [9], which chooses the centroid measured 
by the observers of zero 3-velocity = 0) in a given coordinate system; and the 
Tulczyjew-Dixon (TD) condition [10, 11], which chooses the centroid measured in 
the zero 3 -momentum frame (u^ oc P^). A more recent condition, proposed in [12, 
13], dubbed herein the “Ohashi-Kyrian-Semerak (OKS) condition” (which, as we 
shall see, seems to be favored in many applications), chooses the centroid measured 
with respect to some parallel-transported along its worldline. The spin condition 
generally remained, however, a not well understood problem (this is true even today), 
not being clear, namely, its status as a choice (the discussion is sometimes put in terms 
of which are the “correct” and the “wrong” conditions for each type of particle, 
see introduction of [14] for a review), the differences arising from the different 
choices, and what it means to consider different solutions corresponding to the same 
physical motion. Also, some aspects of each condition have been poorly understood, 
especially the FMP condition and its famous helical motions [15]. The rules for 
transition between spin conditions, and the quantities that are fixed (for different 
solutions corresponding to the same physical body), were established in [13], where 
the numerical solutions were compared in the Kerr spacetime, and it was shown that, 
within the limit of validity of the pole-dipole approximation, the different solutions 
are contained within a minimal worldtube, formed by all the possible positions of 
the center of mass, which lies inside the convex hull of the body’s worldtube. These 
rules were further discussed in [16], and used to show that the helical motions are 
fully consistent solutions, always contained within the minimal worldtube (and to 
clarify the misunderstanding that led to the contrary claims in the literature). 

The non-parallelism between the momentum and the velocity of a multipole parti- 
cle subject to external fields, and its relation with the spin supplementary condition, 
is another old problem. A significant step towards its understanding was taken in 
[17], where a generalized concept of “hidden momentum” (first discovered in the 
context of classical electrodynamics [18-22]) was introduced in general relativity, 
and applied to the study of the TD and CP conditions (the latter designated therein by 
a different name, the “laboratory frame centroid”). These ideas were further worked 
out, with emphasis on the FMP condition, in recent works by the authors [16, 24]. 

In this paper, we discuss in detail the different spin conditions in general rela- 
tivity, the centroids that they determine, their uniqueness/non-uniqueness, and the 
momentum- velocity relation arising from each of them. The different solutions given 
by the different spin conditions corresponding to the same physical motion are com- 
pared in simple examples, and their differences dissected. Building on the works in 
[13, 16] (where the equivalence was shown for free particles in fiat spacetime), we 
prove the equivalence of the solutions to dipole order in curved spacetime; in par- 
ticular, we clarify the dependence of the spin-curvature force on the spin condition, 
as being precisely what ensures the equivalence, and the connection of that with the 
geodesic deviation equation. 
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1.1 Notation and Conventions 

1. Signature — h ++; the Levi-Civita tensor, and we 

follow the orientation [1230] = 1 (i.e. in flat spacetime ei230 = 1); ^ijk = ^ijkO- 
Riemann tensor: + ■ ■ ■ . 

2. denotes the projector orthogonal to a unit time-like vector 
u^. 

3. The three basic vectors in the description of an extended body. is the momen- 

tum; = dz^ I dr is the tangent vector to the reference worldline z^(r); the 
vector field involved in the spin condition = 0 is generically denoted by 

u^. 

4. “Centroid”, “center of mass”, “CM”: have all the same meaning herein, 
centroid as measured by an observer of 4-velocity u^. 



2 Center of Mass in Relativity and the Significance of the 
Spin Supplementary Condition 

In the multipole scheme an extended body is represented by a set of moments of its 
current density 4-vector (the “electromagnetic skeleton”) and a set of moments 
of the energy momentum tensor called “inertial” or “gravitational” moments 
(forming the so-called [8] “gravitational skeleton”), defined with respect to a refer- 
ence worldline (r) which is taken to be some representative point of the body, and 
whose motion aims to represent the “bulk” motion of the body. The natural choice 
for such point is the body’s center of mass (CM); however, in relativity, the CM of a 
spinning body is observer-dependent. This is illustrated in Fig. 1 . In order to establish 
how the center of mass changes with the observer, we need reasonable definitions 
of momentum, angular momentum, mass and center of mass. In fiat spacetime these 
are all well defined notions; but it is not so in curved spacetime, as they consist of 
integrals which amount to adding tensors defined at different (albeit close, if the 
body is assumed small) points; different generalizations of these notions have been 
proposed (see e.g. [11, 25, 26]). The discussion herein is aimed to be as general as 
possible; for that we use the following definitions that hold reasonable (at least to 
lowest orders) regardless of the particular multipole scheme followed. 

Consider a system of Riemann normal coordinates {x^} (e.g. [25, 27]) centered 
at the point z^ of the reference worldline, associated to the orthonormal frame 
at that point, and take it to be momentarily comoving with some observer O of 
4- velocity (not necessarily tangent to the curve z^(r)); that is, at 
and the triad ej spans the instantaneous local rest space of O. We define the momen- 
tum angular momentum mass m(u) and centroid x^y^iu) of the particle 
with respect to O as the tensors at z^(r) (respectively point) whose components 
(respectively coordinates) in this chart are 
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Fig. 1 A free spinning spherical body in flat spacetime. Observer O, at rest relative to the axis of 
rotation, measures the centroid to coincide with the sphere’s geometrical center (and with the 
rotation axis), is the centroid as measured in the P* =0 frame. Observers O' and 

O" moving (relative to O) with velocities v', v" opposite to the rotation of the body, see points 
on the right side of the body moving faster than those on the left side; hence for these observers 
the right side of the body is more massive, and the centroid they measure is shifted to the right by 
Ax = V X Si,/M. The larger the speed v the larger the shift; when v equals the speed of light, the 
shift takes its maximum value, with the centroid lying in the circle of radius PMoiier = Si,/ M 




= 2 / , 

m(u) = -P^Ua = / , 






m(u) 



( 1 ) 

( 2 ) 

( 3 ) 

(4) 



Here E(z, u) = E(z(r), u) is the spacelike hypersurface generated by all geodesics 
orthogonal to the timelike vector dX the point (in normal coordinates it coincides 

with the spatial hypersurface = 0), JE is the 3-volume element on E(z, w), and 
where is the (future-pointing) unit vector normal to E(z, u) (at 
xhese definitions correspond to the ones given in [25], and have a 
well defined mathematical meaning, which can be written in the manifestly covariant 
form (66) and (67). They also correspond, to a good approximation, to the ones given 
in Dixon’s schemes [11, 26]. This is discussed in detail in Appendix “Momentum 
and Angular Momentum in Curved Spacetime”. Note that although we used normal 
coordinates to perform the integrations above, the end results and are tensors, 

which can now be expressed in any frame. ^ 



^ One could say the same about the point xgj^ (w) , although one must bear in mind when transforming 
its coordinates to the new frame that it will still be the CM as measured by the specific observer 
and not the CM as measured in the new frame. 
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The vector 

^ -S'^^up (5) 



yields the “mass dipole moment” as measured by the observer O (of 4-velocity u^), 
and 






m(u) 



( 6 ) 



can be interpreted as the shift, or the “displacement”, of the centroid x^y^(u) relative 
to the reference worldline z^(r). This is readily seen in the coordinate system {x^}, 
where = 0 and and so from Eq. (2) we have 



= 2 [ = f = m(u)xi^^(u); (7) 

7S(z,w) 7s(z,w) 



note that x^ = 0, since the integration is performed in the geodesic hypersurface 
E(z, u) orthogonal to at z^(r). Hence Ax^ = S^^/m(u) yields the coordinates 
x^m(^) of the center of mass measured by O, in the normal system {x^}. Since 
the latter is constructed from geodesics radiating out of Ax is the vector at 
tangent to the geodesic connecting z^ and Xq^(u), and whose length equals that 
of the geodesic; that is, Xq^(u) is the image by the geodesic exponential map of 
Ax: Xq^(u) = exp^(Ax). In flat spacetime (where vectors are arrows connecting 
two points). Ax reduces to the displacement vector from to xgj^(w); in curved 
spacetime it is still a reasonable notion of center of mass shift, and so (5) is a sensible 
definition of mass dipole moment. In particular, its vanishing for some observer 
means that one is choosing z^ as the center of mass Xq^(u) as measured by that 
observer. That is, the condition 

= 0 , ( 8 ) 

implying, in the system {x“}, = 0 = 0, states that the reference 

worldline is the center of mass as measured by the observer 0(u) (or, equivalently, 
that the mass dipole vanishes for 0(u)). Equation (8), for some timelike vector field 
defined (at least) along z^(r), is known as the “spin supplementary condition”, 
which one needs to impose in order to have a determined system of equations of 
motion, as we shall see in the next section. As we have just seen, one can generically 
interpret it as a choice of center of mass. 

In order to see how the center of mass changes with the observer, let us for 
simplicity consider the case with no electromagnetic field, = 0; in this case, as 
explained in detail in Appendix “The Dependence of the Particle’s Momenta on E”, 
under the assumption that the size of the body is small compared with the scale of 
the curvature, the moments (1) and (2) do not depend on the argument of T>(z, u); 
that is, they depend on the point along the reference worldline z^(r), but not on the 
particular geodesic hypersurface E through it. We may thus regard (r) and (r) 
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as well defined functions on z^(r). We shall also introduce the following relations 
which will be useful throughout this paper. Let and u'^ be the 4- velocities of two 

different observers. We can write (e.g. [28]) 



u')(u^ + v^(u' , u)); 7 (w, u') = 



1 

VI — ' 



( 9 ) 



where u) is a vector orthogonal to whose space components yield the 

ordinary 3 -velocity of the observer in the frame = 0 (i.e. the velocity of the 

observer relative to the observer u^). Choose to be the CM as measured by 
that is, choose = 0. In order to obtain the mass dipole 

measured by one just has to contract with u'^\ (d^)^ = this is 

because, under the assumptions above, does not depend on the normal to the 
hypersurface S(z), and thus, in the = 0 frame, we may write in the 

form (7), only with u' in the place of u. The shift of the centroid measured 

by relative to is thus 






m(u') 



— 7 (w, u') 



S^^Vj 3 (u\ u) 
m(u') 



( 10 ) 



cf. Eq. (6). Especially interesting is the case = P^/M, where we denoted M = 
-y/—P^Pa', this amounts to choosing as the CM as measured in the =0 frame, 
= x^y^(P). In this case 






Si. vp 
M 



( 11 ) 



where = v^(u' , P) is the velocity of the observer u'^ relative to the P^ = 0 frame, 
and we denoted by the angular momentum taken with respect to z^ = -^cm(^) 
(note that the tensor 5''^^ depends on the choice of z^, cf. Eq. (2); for the same body, 
SaP general different for different z^^’s). Let us denote also the corresponding 
spin vector S^, so that P^ IM. The space part (both in the = 0 and 

in the P^ = 0 frames, as Ax^ is orthogonal to both u'^ and P^) reads 



Ax^ 



(i X vY 
M 



( 12 ) 



Thus the set of all shift vectors corresponding to all possible observers spans a disk 
of radius RmoWqt = S^^/M, centered at vgj^(P) and orthogonal to and P^, in the 
tangent space at vgj^(P). This statement can roughly be rephrased as saying that 
the set of all possible positions of the center of mass as measured by the different 
observers is contained (and fills) such disk (in fiat spacetime this is an exact statement, 
originally by Moller [7]). Let us dub such disk the “disk of centroids”, and its radius 
^Moller the Moller radius. 
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In order to illustrate how this works, consider for simplicity the setup in Fig. 1: 
a free spinning spherical body in flat spacetime. Observer O, at rest relative to the 
axis of rotation, clearly must (by symmetry) measure the CM to coincide with the 
body’s geometrical center (and with the rotation axis). The rest frame of such an 
observer corresponds in this case to the =0 frame (this statement will be made 
obvious in Sect. 3.2.3 by Eq. (23)). Consider now other observers. O' and O", moving 
(relative to O) with velocities v' and v", opposite to the rotation of the body; for these 
observers the center of mass is shifted to the right, as they measure the right side 
of the body to be more massive. The larger the speed v the larger the shift; when v 
equals the speed of light, the shift takes its maximum value, with the centroid lying 
in the circle of radius /?Moller- 

In spacetime, the set of all possible centroid worldlines forms a worldtube — the 
“minimal worldtube” [13], see Fig. 2 — typically very narrow,^ and always contained 
within the convex hull of the body’s worldtube (see [25] for its precise definition). 
This can be shown in different ways. In fiat spacetime, it is not difficult to show 
(see e.g. [29, p. 313]), that if the mass density-energy density p(u) = T^^upUa is 
positive everywhere within the body and with respect to all observers (i.e. if the 
weak energy condition holds everywhere within the body), then the center of mass 
with respect to any must be within the body’s convex hull. The fiat spacetime 
arguments apply just as well in a local Lorentz frame {x^} (under the assumption 
above that the body is small enough so that we can take it to be nearly orthonormal 
throughout it). In the same framework one can show that Ruollev is the minimum 
size that a classical particle can have in order to have finite spin without containing 
mass-energy flowing faster than light, that is, without violating the dominant energy 
condition. The dominant energy condition implies p > |/|, where 
Let a be the largest dimension of the body; in the local Lorentz frame centered at 
that = 0, we may write. 






X X < J x|7|JE < J 



pxdT, < Ma ^ a > — . 

M 



(13) 



3 The Momentum- Velocity Relation 



The force and the spin evolution equations for a multipole particle in an external 
electromagnetic and gravitational field are [11] 



^For the fastest spinning celestial body known to date, the pulsar PSR J1748-2446ad (rotation 
frequency 716 Hz, estimated radius a = \6 km), whose equatorial velocity is 0.23c, ^Moiier —0.1a, 
see also the contribution by Giulini in this volume. 



222 



L.F.O. Costa and J. Natario 



D 1 

= qF^Ul^ + + F'l-j- , 

+ (14) 

+ r“^ (15) 

dr 

where and jj.a^ are, respectively, the particle’s charge, electric dipole vector, and 
magnetic dipole 2-form (for their precise definitions, see [24]). (2^^ ^ ) denotes the 

force (gravitational and electromagnetic) due to the quadrupole and higher moments, 
and is sometimes called the “torque” tensor. = dz^ I dr is the tangent to 
the reference worldline z^(r). These equations form an undetermined system even 
in the case DP^/dr = 0 and = 0 (for there would be 13 unknowns: P^, 3 
independent components of and 6 independent components of for only 10 
equations), manifesting the need for a supplementary condition, which amounts to 
specify the worldline z^(r), relative to which the moments are taken. The condition 
= 0, for some unit timelike vector field defined along z^, kills off 3 
components of the angular momentum and makes that choice, requiring, as explained 
in the previous section, z^(r) to be the centroid as measured by an observer of 
4- velocity u^. Contracting (15) with one obtains an expression for the momentum 
of the particle. 



, (16) 

where 7 (f/, w) = —U^Ua, m(u) = —P^Ua, and in the second term we used 
= 0. Equation (16) tells us that, in general, P^ is not parallel to the CM 
4- velocity in this section we will discuss the reason for that. 

The vector P^ can be split in its projections parallel and orthogonal to the CM 
4- velocity U^: 

P- = ^ mU\ Ph“d ^ . (17) 

where m = —P^Ua’^^ the “proper mass”, i.e. the energy of the particle as measured 
in the CM frame, and 

is the projector orthogonal to U^. We dub the parallel projection P^^ = mU^ 
“kinetic momentum” associated with the motion of the center of mass; it is the most 
familiar part of P^, formally similar to the momentum of a monopole particle. The 
component P^^^ orthogonal to is the so-called “hidden momentum” (e.g. [17]). 
The reason for the latter denomination is seen taking the perspective of an observer 
0(U) comoving with the particle: in the frame of 0(U) (i.e. the = 0 frame) 
the 3 -momentum is in general not zero: P = P^id 7^ 0; however, by definition, the 
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particle’s CM is at rest in that frame; hence this momentum must be somehow hidden 
in the particle. consists of two parts of distinct origin: 



pCX — 

^hidl = 

pa ^ 
Miidr — 



1 

7(m, U) 

1 

7(m, U) 






Du^ 

dr 

up, 



(18) 

(19) 



which we shall explain. is a term that depends only on the spin supplementary 
condition, i.e. on the choice of the field relative to which the centroid is computed. 
In this sense we say it is gauge. This type of hidden momentum was first discussed 
in [17] (dubbed “kinematical” therein). The vector needs only to be defined along 
z^(r)\ but if one takes it as belonging to some observer congruence in spacetime 
(one can always do such an extension), and decomposing 

Ua\P — (P )aUp €(^p^§LJ^ U + 0(^p ( 20 ) 



where (a^)^ = u^.pU^ is the acceleration of the observers 

their vorticity, and O^p = (^“)^(^“)^W(A;i/) shear/expansion, we may write 



■ ( 21 ) 

The kinematical quantities in Eq. (20) are connected to “inertial forces”, namely 
= —(a^)^ and = uo^ are, respectively, the “gravitoelectric field” and the 
“Fermi- Walker gravitomagnetic field” as measured by the congruence of observers 
see [23, 28]. For this reason we dub “inertial” hidden momentum. 

Phidr is associated to the “torque” tensor and (in general) consists of two 
parts: one which is again gauge and arises for certain choices of reference worldline 
(i.e. of the field u^) when a physical torque acts on the particle, plus another part 
which is not gauge, and cannot be made to vanish by any center of mass choice. Fol- 
lowing [17] we dub the latter “dynamical hidden momentum”. To dipole order, this 
dynamical part consists of a form of mechanical momentum that arises in electro- 
magnetic systems, first discovered in [18], and since discussed in number of papers, 
e.g. [19-21], including recent works [17, 22, 30]. To quadrupole and higher orders, 
there are both electromagnetic and gravitational contributions to and thus to 

Phidr • 



3.1 Inertial Hidden Momentum^^: Center of Mass Shift 
and the Decoupling of from 

Equation (18) tells us that when varies along z^(r) (i.e. Du^ /dr 7 ^ 0), in general 
^Wdi ^ parallel to P^. This comes as a natural consequence of 
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what we discussed in Sect. 2 about the observer dependence of the center of mass. 
Recall the situation in Fig. 1, difree spinning particle in flat spacetime: the centroid 
measured by observers moving relative to O are shifted relative to xcu(0). If the 
velocity of these observers changes along z^(r), e.g. if at an instant we have 
and at r" the shift changes accordingly, giving rise 

to a non-trivial velocity of the centroid. That is, superfluous centroid motions can 
be generated just by changing along z^(r). The momentum, however, remains 
the same, DP^ /dr = 0, cf. Eq. (14); thus the situation may be cast as the centroid 
acquiring a non-zero velocity in the =0 frame (which is in this case the rest frame 
of the observer O). This amounts to saying that gains a component orthogonal 
to (denote it by f/^); conversely, there is a component of P^ orthogonal to f/^, 
which is the hidden momentum. Let us see this in detail. Denote by and f/^, 
respectively, the components of parallel and orthogonal to 

C/" = t/" + C/“: t/" = , (22) 

where m = —U^Pa, and = P^ P [ 3 /M'^ + denotes the projector in the 

direction orthogonal to P^. is, up to a 7 factor, the 3-velocity of the centroid in 
the P^ =Q frame, cf. Eq. (9) above (substitute therein u'^ = = P^/M). In 

the special case = 0, we have from Eq. (16) 



= ^ (hP)» s^P^ , (23) 

m{u) dr 

showing that indeed the variation of along z^(r) leads to a centroid moving in the 
zero 3-momentum frame, and to a non-parallelism between and P^ (it is actually 
the sole reason for that in the special case = 0 ). If we further specialize to the 
case of a free particle (depicted in Figs. 1 and 2), DP^ jdr = and noting that the 
centroid shift can be written as = — (x^j^(E) — x^y^{u)) = cf. 

Eq. (10), the shift variation along becomes 



Pq ry ^ ry 

_ t- — JJ^ — TJ^ 

dr M'^ dr ^ 



(24) 



In the second equality we used Eq. (15), in the third we used Eq. (22). That is, the 
variation of the shift equals the component of orthogonal to mathematically 
formalizing the heuristic arguments in Figs. 1 and 2b. One should note however that, 
although this reasoning is useful to gain intuition, in the general case (DP^/dr / 0 ) 
Eq. (24) does not hold, and Uf is not just the variation of Ax^; this is because 
the centroid xgj^(E) is in general no longer at rest in the =0 frame. (When 
one employs the TD condition, = P^ /M, if DP^ /dr / 0 then the centroid 
4-velocity is not in general parallel to cf. Eq. (23) or, explicitly, Eq. (28).) For 
the general case the argument can be given as follows: the centroid position depends 
on the field relative to which it is measured, and its velocity on the variation of 
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Fig. 2 a The body’s worldtube {larger cylinder), the worldtube of centroids {narrow inner cylinder), 
and the three basic vectors involved in the description of the motion: the momentum P^, the 
4-velocity = dz^ j dr, and the vector field involved in the spin supplementary condition 
= 0. The vector is orthogonal to the hypersurfaces 'E{t,u) at , and has the interpretation 

of the 4-velocity of the observer measuring the centroid. These three vectors are not parallel in 
general, b A curve with a varying along it; that leads to a varying shift, see Fig. 1, leading to a 
non-zero velocity of the centroid in the = 0 frame, cf. Eq. (23), and possibly to an acceleration 
without any force involved 



P^, however, is unaffected by that, which means that in general ]){ U^. This 
is precisely what Eq. (23) says. 

When ^ 0, then (obviously) P^ has a component P^^^ orthogonal to U^. 
Noting from (22) that P^ = M^{U^ — Uff)/m, and from Eq. (17) that = 
{P^ — P^^^/m, we obtain the following relations between the hidden momentum 
and Uff. 

U1 = ^h“d = (25) 

(these are fully general expressions, valid when 7 ^ 0 ). 

Differentiating (23) with respect to r, we see that when jdr^ 7 ^ 0, in general 
the centroid acceleration a^ = DU^ /dr will be non-zero, i.e. it will accelerate 
without the action of a force. That can lead to exotic motions; an example of that 
are the famous Mathisson helical motions, as shown in [16]; the same principle also 
leads to the bobbings in the “tetherballs” studied in [17] (in this case a force is 
involved, but it is not parallel to the acceleration), or the ones studied in Sect. 3.4. Of 
course, such effects can always be made to vanish by a choosing some parallel 
transported along (r ) ; hence one can say that they are a complicated description for 
the same physics that, in principle, could be described in a simpler manner. In Eig. 2 
we illustrate the situation for a free particle in flat spacetime: the worldline z^(r) of 
a centroid measured by a field of observers that varies along it has, in general. 
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superfluous motions. These are conflned to the worldtube of centroids, which is a 
straight tube (always within the convex hull of the body’s worldtube, see Sect. 2) 
parallel to the constant momentum P", and whose cross section orthogonal to is 
the disk of centroids, orthogonal to illustrated in Fig. 1. Choosing Du^/dr = 0 
(e.g. inertial frames), the centroid worldlines obtained are straight lines parallel to 
yielding the simplest description possible for this problem. 



3.2 Center of Mass and Momentum-Velocity Relation 
of the Different Spin conditions 



In this section we shall consider, for simplicity, the case = 0, so that the only 
hidden momentum present is the inertial hidden momentum Although all forms 
of hidden momentum have some sort of dependence on the spin condition, by the 
circumstance that = dz^ j dr depends on the reference worldline chosen, 

Fj^di the part that arises solely from it. Note that = 0 corresponds for instance 
to the case of pole-dipole particles in purely gravitational systems. 



3.2.1 The Corinaldesi-Papapetrou (CP) Condition 

This spin condition was introduced in [9] for the Schwarzschild spacetime, where it 
was cast, in Schwarzschild coordinates, as 5^^ = 0. One can write it covariantly as 
= t), with corresponding to observers that have zero 3-velocity in such 
coordinates, = 0. These are the so-called ''static observers'% whose 4-velocity 
is parallel to the time Killing vector: - ^^tatic a d/dt. Hence, this condition 

chooses as reference worldline the centroid measured by the static observers. It 
can be generalized taking the static observers of other stationary spacetimes, or, 
as done in [13], to arbitrary metrics taking the congruence of observers with zero 
3 -velocity in the coordinate system chosen (let us dub it the “laboratory” frame). This 
effectively amounts to considering an arbitrary congruence of observers, which will 
be the problem discussed below: take a matter distribution described by the energy- 
momentum tensor and a congruence of observers wj^b arbitrarily 

flx; then And the worldlines z^ obeying the condition 5'^(z) “kbfe) = 0— which 
demands z^ to be the center of mass as measured by the observer located at 

that precise point. At first sight, it does not even seem obvious that such solutions 
exist. For when one considers an observer at a given point x^, the centroid 

with respect to uf^^^{xl) will be at some point , in general not coinciding with 
x^\ and then at the site , the observer uf^^{x 2 ) that lies there is a different one, 
and measures its centroid to be in yet another different point x^, and so on. This is 
illustrated in Fig. 3a. 

We shall now show that the solution indeed always exists, but in general it 
is not unique. Consider the vector held (the mass dipole with respect to the 
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(a) \ \ \ \ V V \ V \ V ,-7 ' (b) 




Fig. 3 a Centroid as measured by different observers of the congruence Colors specify an 
observer and the corresponding centroid. Observer measures the centroid to be at 

^2 = observer uf^^(x 2 ) = at x^ is a different one, thus its centroid will in general 

be at a different point x^. Observer at x^ measures the centroid to be at yet another 

different point, and so on. b For the observers to agree on the centroid position, they must be 
orthogonal to the same totally geodesic hypersurface S . In this case the condition = 0 fixes 

an unique worldline 



which is a function of where 5'^(z, i/iab) is the angular momentum taken about 
and in the geodesic hypersurface orthogonal to at z^. Consider moreover 
the intersection of the convex hull of the body’s worldtube W with some arbitrary 
spacelike hypersurface E, see Fig. 4; and let Rq{z) be the projection of Jq(z) on E. 
At the bounda^ of the region IF Cl E it is clear from the definition of S^^{z, u) 
in Eq. (2) that Rq{z) points inwards (since, by virtue of the weak energy condition, 
Ta/su^u^ > 0 for any time-like vector u^). Given that Jq(z) is a continuous vector 
field (since “fab is an observer congruence), the Brouwer fixed point theorem implies 
that the flow of do must have a fixed point; i.e. Jg = 0 at least one point within 



Fig. 4 The vector field d^iz) (i.e. the mass dipole as measured by the observers uf^^^ at z^), at the 
boundary of the region formed by the intersection of the convex hull W of the body’s worldtube 
with some space-like hypersurface E, always points inwards. Since the field Jq(z) is spacelike 
and continuous, the Brouwer fixed point theorem ensures that Jg(z) = 0 at least one point 
z^ G IF n E. In other words, there is at least one point which is the center of mass as measured by 
the observer at that point 



observer wfabfe)) 



doiz) = -S'^piz, Mlab)Mfb(^)’ 
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W n E. Since is a space-like vector, this effectively means that = 0 at 
that point. 

The argument above is analogous to the one followed by Madore [25] to produce 
a similar proof for the vector field 5'^(z, P)P^{z). 

Hence, at least one worldline will exist such that 5'^(z) Whbfe) = 0; but in 
general it is not unique. The analysis in Sect. 3.2.3 provides an example. Take, in 
fiat spacetime, the congruence to be observers rotating rigidly with the angular 
velocity of Mathisson’s helical motions, ^2 = M/ 5^ (i.e. take the laboratory frame to 
be the observers at rest in the coordinates associated to a frame rotating with angular 
velocity Q), opposite to the sense of rotation of the body, and around the centroid 
measured in the P^ = 0 frame, In this case, every point z^ within the 

worldtube of centroids is a center of mass with respect to the observers at rest in this 
frame, i.e. every such point is a solution of 5'^(z)w(^|3(z) = 0. 

In some cases the solution is unique; it is clearly so when the observers of the 
congruence agree on the centroid position (note however that this is a sufficient, but 
not necessary condition for uniqueness). The moments (thus the centroid, Eq. (4)) are 
defined integrating on geodesic hypersurfaces E(z, w) orthogonal to u^{z) at some 
z^; in order for different observers, e.g. and to agree on the centroid position, 
their hypersurfaces E(zi, t/i) and E(z 2 , U 2 ) should be the same. Since E(zi, t/i) is 
geodesic at zf (i.e., it is constructed from geodesics orthogonal to radiating out 
of the point z)^), and E (z2, U 2 ) is geodesic at Z 2 and that must be true for every other 
point, then this means that the congruence must be orthogonal to a totally 

geodesic hypersurface E. This implies to be vorticity-free, = 0 (so that 

it is hypersurface orthogonal), and rigid, Oap = 0 (so that the second fundamental 
form of E vanishes). Since, from the general decomposition (20), we have, for any 
spatial vector tangent to E(z, wiab), 



VxMgb = + OapX^, (26) 

then this implies that the congruence is parallel^ along E(z, wiab), ^x^£b ~ 
This is the case of static observers in a static spacetime. When these conditions 
hold, and within the regime where the normal coordinates can be taken as nearly 
rectangular throughout the body"^ (that amounts to taking A ^ 1 in Eq. (69), which 
is reasonable in this context, see Appendix “Momentum and Angular Momentum in 



^Such observers are said to be “kinematically comoving” (see [31, Sect. 6.1]). 

^To see the reason for this assumption, consider two observers ^2 ~ 

orthogonal to the same geodesic hypersurface E. Let |v“} and |x“}, respectively, denote 
their normal coordinate systems, related by = A^(x^ “ -^ 2 ) (where A A is a function 

of x^). They will agree on the centroid position if •*Cm(“2) = “ ■* 2 )' 

Eq. (4) we see that it is the case when dx® = dx®, x* = AL(x-^ “ -^ 2 )’ ^^th A' . a constant 

j ■' 

matrix. Due to the curvature, however, this cannot be exactly so; choosing da\x 2 — da\x 2 ^ we have 
A^ = (5^-bC)(||R|| XX 2 ) + O ( 1 1 R 1 1 x| ) , e.g. Eq. 1 1 . 1 2 of [3 2] . It follows that, for all observers within 
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Curved Spacetime” and footnote 13), the observers will agree on the centroid 
position. If one starts with an observer at a point xf, and computes 

the centroid it measures from Eq. (4), the worldline = Xq^(ui) obtained will 

therefore obey the CP condition = 0, since repeating the computation 

in the normal coordinates of the observer uf^^(z) at z^ yields the same result. An 
example is when are the observers associated to a global inertial frame in flat 
spacetime. In this case not only is the same vector everywhere, as one can set 
the Lorentz frames of each observer uf^^(x) in an hyperplane E to be the same up to 
spatial translations; so all observers of this frame will measure the centroid at the 
same point (i.e. there is a well defined, unique centroid associated to such frame). 
This is an exact statement in this case. 

Momentum-velocity relation . — Since is a well defined vector field in the 
region of interest, we may write Du^^jdT = and therefore, from Eq. (21), 

the momentum reads 

/>« = (-7G^ - ; (27) 

here 7 = minus the acceleration of the laboratory 

observers (i.e. the gravitoelectric field), is their vorticity (or the Eermi- Walker 
gravitomagnetic field [23, 28]), and 0a(3 their shear/expansion tensor. Hence we 
have a well defined expression for P" in terms of and the kinematics of the 

congruence telling us that differs from m only if the laboratory observers 
measure inertial forces (i.e. if they are accelerated, rotating, or shearing/expanding). 



3.2.2 The Tulczyjew-Dixon (TD) Condition 

The condition = 0 amounts to choosing = P^/M, i.e. the centroid is the 

one as measured in the zero 3-momentum frame. As shown in [33, 34], for a given 
matter distribution, described by the energy-momentum tensor T^^{x), there is only 
one worldline z^(r) such that P^ = 0 and being both evaluated at 
and using the hypersurface E (z, P) orthogonal to P*^ at z^). In other words, this spin 
condition specifies an unique worldline. It is the central worldline of the worldtube 
of centroids, as seen from Eq. (12). Erom (16) and (17), we have the expressions for 
the momentum 

p<^ = — ( = mU^ + ( 28 ) 

m \ dr / m dr 



(Footnote 4 continued) 

the body’s convex hull, ||xgj^(w 2 ) — x^y^(ui)\\/a < X, X = ||R||a^; hence xgj^(^ 2 ) — vgj^(wi) if 
A 1. This is, as expected, the condition that the metric + C>(||R||x^) can be taken as 

nearly flat throughout the body. 
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Here DP^/dr is the force, given by Eq. (14); in the absence of electromagnetic 
field, and to pole-dipole order, this expression can be manipulated into the well 
known expression (e.g. [14]) 



determining uniquely in terms ^ of and Ra/3j6- A more general expression 

for the case when / 0, and to arbitrary multipole order, is given in Eq. (35) 
of [17]. 

3.2.3 The Frenkel-Mathisson-Pirani (FMP) Condition. Helical Motions 

The condition = 0, i.e. u^ = states that the centroid is measured in 

its own rest frame; in other words, it chooses the center of mass as measured by an 
observer comoving with it. This condition does not yield an unique worldline though: 
it is infinitely degenerate. For a given matter distribution, described by T^^(v), there 
are infinitely many worldlines through it such that = 0. Indeed, any 

point within the disk of centroids can be a solution (i.e. can be a center of mass as 
measured in its proper frame) provided that it moves with the appropriate velocity. 
In order to see that, we start by an heuristic argument, originally due to Moller 
[7]: consider, in Fig. 1, a point in circular motion opposite to the rotation of the 
body, with a radius R = v' Si, /M such that it passes through the centroid Xq^(u') 
measured by the observer O', and having therein the same velocity as O' . Such point 
instantaneously coincides with x^y^(u'), and at the same time is at rest with respect 
to 0'\ it is thus a center of mass computed in its own rest frame, and will be so at 
every instant as the motion is circular. The angular velocity of such points is constant, 
Q, = v' /R = M/Si, (i.e. does not depend on R). That is, consider a disk of the same 
size of the disk of centroids, rigidly rotating about the centroid v^j^(P) measured 
by O; any point of such disk is a centroid computed in its rest frame, and is thus a 
solution of S^^Up = 0. This is the origin of the helical motions (in a frame moving 
with respect to O, the circular motions become helices). 

These facts can be explicitly checked from the equations of motion. First we note 
that, with this spin condition, the momentum becomes, cf. Eq. (16), 




(29) 



P" = mt/" + = mt/" + 



(30) 



where 5“ is the spin vector defined by 




(31) 



^The factor mlM^ (involving via m) can be determined by the normalization condition 
U^Ua = - 1 . 
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Noting from (30) that = P^Sa = 0, and using = 0, the component 

of the 4-velocity orthogonal to P^ is, from Eq. (23), 

Ul = ( 32 ) 

which in the =0 frame reads 

1 DU - 

U X S = 0. (33) 

M dr 



This is a differential equation for the space components U ; as discussed above, 
V = U/j(P,U) has the interpretation of 3 -velocity of the centroid in the = 0 
frame. Take now for simplicity the case of a free particle in flat spacetime; in this case, 
fromEq. (14) wehave = 0;also,fromEq. (35), it follows that/) 5'^/Jr = 0 

(since S^Ga = 0, which can be seen substituting (30) in DP^ /dr = 0); thus M and 
S in (33) are constants, and the solution for the reference worldline (U = dzidr) 
is, in rectangular coordinates (taking S along e^), 

j“(r) = {^T, -R cos (y r) , R sin (y r) , o) (34) 



where R = vjS/M, and 7 = j(P, U) = —P^Ua/M = Vl — v can take 
any value between 0 and 1 . These are circular motions of radius R and frequency 
Q = M/jS, centered about the centroid vgj^(P) measured in the =0 frame. 
They may not seem at first the same motions we deduced from the heuristic argument 
above; in particular, the fact 7 can be arbitrarily large has led some authors to believe 
that the radius of these motions, for a given body, can be arbitrary, and for this reason 
deemed them unphysical [35-37]. That is not the case; the reason for that is that S is 
different for all the helical representations corresponding to the same body. Let z^ and 

z'^ denote two different helical solutions. The scalar S = ^S^Sa = yj S^f^Sap/2, 

for a spin tensor obeying = 0, is the magnitude of the angular momentum 

taken about z^ = x^y^(U). It should in be different, /<9r the same matter distribution 

from S' = ^S'^^S'^p/2, since S'^^, obeying S'^^U'p = 0, is the angular 
momentum about a different point, z"^ = Xq^(U'). It is shown in Sect. IV of [16] 
that, for all helical motions, S = 5'^/7, where = yfs^^^^/2 is the magnitude 

of the angular momentum taken about ^cm(^) (i-e- = 0)- So indeed these 

motions have a finite radius and constant frequency, as deduced above: 
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Hence we see that the famous helical motions are just another exotic effect gen- 
erated by the variation, along z^(r), of the field of observers (= in this case) 

with respect to which the centroid is computed; what is special about them is that 
in this case the non-trivial motion induced on the centroid is such that the latter is 
always at rest with respect to the observer measuring it. Thus they are not unphysical, 
contrary to some claims in the literature; but they do not contain new physics either, 
they are just alternative, unnecessarily complicated descriptions for physical motions 
that can be described through simpler representations: for example, the non-helical 
solution that this spin condition also allows, which in the case of a free particle in 
fiat spacetime is uniform straight line motion (corresponding to i; = 0, 7? = 0, in 
Eq.(34)). 

It is also worth noting that, from a dynamical perspective, the consistency of 
the helical motions (namely, the fact the centroid accelerates without any force) is 
explained through an interchange between kinetic momentum and 

hidden momentum which occurs in a way that their variations cancel 

out at every instant, such that = mU^ -h remains constant; see Fig. 3 of 

[16]. This is exactly the same principle behind the bobbings due to discussed 
in Sect. 3.4. 



Features of the FMP Condition: Fermi-Walker Transport 
and Gravito-Electromagnetic Analogies 

If one employs the Frenkel-Mathisson-Pirani condition, the spin vector of a gyro- 
scope (if = 0) is Fermi-Walker transported along the worldlines of any of the 
centroids obeying this condition. This can easily be seen substituting Eq. (31) in (15) 
to obtain 

DS^ 

—— = S^a^U^ . (35) 

dr 

This is the most natural description for the spin evolution, where the mathemat- 
ical definition of a locally non-rotating frame meets the physical one: gyroscopes 
“oppose” changes in direction of their rotation axes; the axis of torque-free gyro- 
scopes define physically the non-rotating frames. On the other hand, Fermi-Walker 
transport is the mathematical definition of a non-rotating frame adapted to an 
arbitrarily accelerated observer: Vxje^ = that is, it admits 

“rotation” (actually boost) in the time-space plane formed by and a^, unavoid- 
able to keep the time axis of the tetrad parallel to the 4- velocity (U = eg), so that the 
triad spans the observer’s local rest space; but no additional spatial rotation (i.e. 
the axes orthogonal to a^ are parallel transported). 

Another interesting feature of this spin condition is that is gives rise to three exact 
gravito-electromagnetic analogies [23, 24]: (i) the spin-curvature force (penultimate 
term of Eq. (14)) becomes Fq = where , analogous 

to the force on a magnetic dipole (second term of Eq. (14)), where 
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Baf 3 = (ii) Eq. (35) becomes, in an orthonormal frame “adapted” to a 

congruence of observers, dS^ ! dr = {S x Hf 12, where H is the “gravitomagnetic 
field”, analogous to the precession of a magnetic dipole, DS jdr = fiyiB (first term of 
Eq. (62)); (iii) the inertial hidden momentum, cf. Eq. (30), is 
with = —a^ the “gravitoelectric” field as measured in the centroid frame, for- 
mally analogous to the electromagnetic hidden momentum, Eq. (65). These analogies 
(apart from their theoretical interest) provide useful insight to study some problems; 
they are discussed in detail in 124] . 

The downside of this condition is the fact that it is not always easy to set up the 
non-helical solution. It is done through suitable ansatzs in Sect. 3.4 (at an approximate 
level), or at an exact level, in very special systems, in 124] (therein it is seen to be 
a good choice, as it takes advantage of the symmetries of the problems to yield the 
simplest equations). Prescriptions in the case of Schwarzschild and Kerr spacetimes 
are also proposed in 138, 39]; however no general rule is known. 



3.2.4 The Ohashi-Kyrian-Semerak (OKS) Spin Condition 

This condition, introduced in 112], and first discussed in depth in 113], amounts to 
choosing a vector field parallel transported along Du^ldr = 0, which 

causes the inertial hidden momentum and its associated gauge motions to van- 
ish, cf. Eq. (18). In the general case where the torque tensor is non-zero, as we 
shall see in Sect. 3.5.1, some superfluous motions may still be present though, due to 
the pure gauge part of the hidden momentum related to (in gravitational sys- 
tems, is usually less important, as it involves the particle’s quadrupole moment). 

When = 0 (the problem at hand herein), it yields the simplest momentum velo- 
city relation possible, = mU^, and a centroid that accelerates only when there 
is a force, ma^ = (this becomes ma^ = F^ for pole-dipole particles in 

gravitational fields, since m = M = ^/—P^Pa is constant, as readily seen contract- 
ing Eqs.(14) or (40) with P^). Equation (15) also takes a simple form, yielding a 
spin tensor parallel transported along z^(r), /dr = 0. 

This condition obviously does not specify an unique worldline through the body; 
it is infinitely degenerate, because there are infinite possible choices of (the only 
restriction imposed is Du^ /dr = 0); but another of its advantages 113] is that one 
does not need^ to explicitly determine to solve the equations of motion (for 
pole-dipole particles), only its value at the initial point is needed. These properties 
together make this condition the most suitable (at least in that case) for numerical 
implementation . 



^We thank O. Semerak and A. Harte for discussions on these issues. 
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3.2.5 Uniqueness of the Centroid Versus Determinacy of the Equations 

There are some apparent contradictions in the literature regarding the uniqueness of 
the worldline specified by the different spin conditions, and what that means in terms 
of the determinacy of the equations of motion. On the one hand most authors (e.g. [7, 
10, 13, 17, 26] ) argue, in agreement with the discussion above, that the FMP condition 
does not uniquely specify a worldline through the body; on the other hand, it has 
recently been argued [40, 4 1 ] that it uniquely specifies the motion, given certain initial 
conditions. Also, in [10, 13], it is said that the CP condition yields an unique solution, 
whereas in the analysis above we have seen that, depending on the coordinate system 
chosen, it may or may not yield an unique center of mass. Our considerations above 
are based on starting with a test body whose matter distribution is described by an 
energy-momentum tensor and asking the following question: given 

does the condition = 0 yield an unique worldline? As we have seen, from 

the four conditions studied above, the answer is affirmative, as a general statement, 
only for the TD condition. 

But if one takes the perspective of the initial value problem for the equations of 
motion (14) and (15), the impact of the uniqueness/non-uniqueness of the center of 
mass definition is not straightforward. First of all one should notice that, without 
further assumptions, the system (14) and (15), supplemented by (8), can be deter- 
mined only to dipole order and if = 0 (otherwise one needs evolution laws for 
^a(3^ higher order electromagnetic and gravitational moments). In this 

case, all the conditions yield a well defined solution if sufficient initial conditions 
are provided; and it is the type of initial data needed to determine the equations that 
depends on the nature of center of mass definition given by each of the conditions. 

On general grounds one can say that if the equations of motion can be written as 
the explicit functions (dot denotes ordinary derivative along U) 

z"(r) = t/“ = t/"(z, P, S'""); P°‘ = /"(z, P, 5"^ = ^"^(z, P, 

then, given the initial values {z“, the system is determined. The first 

equation is the explicit velocity-momentum relation; but the other two also require 
such a relation, as can be seen by writing explicitly 

Thus, in order to have and as explicit functions of (z, P, 5^^), we need to 
have an explicit relation U^(z,F, S^^). 

In the case of the OKS condition, since one has simply = P^jm = P^/M, 
cf. Sect. 3.2.4, the statements above obviously hold, and the solution is determined 
given {z^, P^, or, equivalently, {z^, ^}|in- 

The situation is similar for the TD condition (only with a more complicated 
velocity-momentum relation). Equation (29) is an explicit relation P, S^^)\ 

thus, given P^, the solution is determined. The initial data 
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{z“, 5“^, t/“, m} lin is equally sufficient because one can extract |in from Eq. (28) 
(substituting therein DP^/dr by the explicit expression — an equa- 

tion for is obtained by squaring (28)). 

The case of the CP condition is also essentially similar. One obtains an explicit 
relation U^(z, P, 5^^) as follows.^ Substitute decomposition (20) into Eq. (16), with 
to obtain 

pa ^ ~ ^ , ( 36 ) 

where 7 = This is an equation for P^ in terms of and the 

quantities and 0a(3 which are given in advance (see Sect. 3.2.1 and the 

equivalent Eq. (27)). We need now to solve for U^. Expressing (36) in its components 
in a frame where = 0, we obtain 

= P‘ + S‘jGJ , A\ ^ [P% - S) ' - 7)] , 

where I is the centroid velocity in the = 0 frame. This is a system 

of linear equations for the three components with solution -h 

] . The component (and subsequently, ) is then obtained from the normal- 
ization condition —1 = U^Ua = — v^). We thus end up with an explicit 

relation U^{z, P, 5^^), meaning that, given the initial values {z^, , P^}|in, the 

solution is determined, as asserted in [13]. The set {z^, , U^, ^}|in is also suffi- 

cient, in agreement with the claim in [10], because one immediately obtains P'^lin 
from (27). Einally, note that this is a distinct problem from the one addressed in 
Sect. 3.2.1 (where we started just with a matter distribution T^^(x) and imposed 
which case, as we have seen, the solution always exists but in general 

is not unique). Herein one assumes the existence of some T^^(x) that is compatible 
with the initial conditions prescribed (conversely, in the prescription of Sect. 3.2.1 
there is no longer freedom to choose an arbitrary initial position z^). 

The case of the EMP condition has some important differences. The momentum- 
velocity relation is (30); the acceleration can be written as (cf. Eq. (24) of [13]) 

a"(z, U, S>^^) = 2 ( 37 ) 

with defined by (31), S'^ = ^JS^Sa, and = DP^/dr. Substituting in (30) one 
obtains an explicit relation P"(z, U, 5'^^). However, one cannot a priori guarantee 
that such relation can be inverted into a relation U^(z, P, (such problem, in the 
general case, has not yet been tackled in the literature, to the authors’ knowledge). 
In the special case of a free particle in flat spacetime, we have, from (30), 



^We thank O. Semerak for his input on this issue. 
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a“(P, ; t/“(P, 5^ = ^ ■ (38) 

This is an explicit relation U^(F, (m can be determined through the condition 
U^Ua = —1); therefore, in agreement with the claims in [40, 41], the motion 
is indeed determined given the initial data [z^ , , ^^}|in (i-e. this set of data 

specifies one particular solution of the degenerate condition = 0). On the 

other hand (unlike the situation for the other three spin conditions), the set of data 
{z“, C/“, m} lin is not enough; one needs, additionally, the initial acceleration 

in agreement with the claims in e.g. [11, 17] . This is clear from Eqs. (30) and (38): the 
set of initial data is equivalent to {5^^, These features 

are readily understood in the framework of the discussion in Sect. 3.2.3: as we have 
seen, the motion of an helical solution = x^y^iU) is a superposition of a circular 
motion centered at the centroid measured in the =0 frame, vgj^(P), of radius 
R = II Ax 1 1 and angular velocity = —M 5^/ 5'^, combined with a boost of 4- velocity 
P^/M. Ax^ = z^ — relative to the center of the helix. 

Given S^, and one obtains x^M(P)|in = — Ax-^ from the expression 

Ax^ = Pp/ M'^ , cf. Eq. (10); follows using + IP^^Ax^^, and 

therefore the motion is completely determined. On the other hand, if instead of |in 
one is given { , m} |in, one cannot determine Ax^ ; that is the reason why one needs 
the acceleration, as it contains precisely the same information: a^ = — Ax^M^/5^, 
cf.Eq. (38). 



33 The Dependence of the Spin-Curvature Force on the Spin 
Condition; Equivalence of the Spin Conditions 



We have seen that the significance of the spin condition ^ = 0 is that of a choice 

of representative worldline in the body, more precisely requiring such worldline to 

be, at each event, the center of mass as measured by an observer of 4-velocity u^. We 
have thereby implied that the different spin conditions yield different, but equivalent 
descriptions of the motion of a given body, all contained within the worldtube of cen- 
troids, which in turn lies within the convex hull of the body’s worldtube. That is easy 
to see for a free particle in fiat spacetime, where indeed the different solutions stay 
close forever and within the straight worldtube depicted in Eig. 2. However, when 
external non-homogenous fields are present, changing z^ means not only changing 
the point where the fields (i.e. and Rap-fd) are evaluated, but also changing the 
moments jj^ap, d^, and the 2^^^ moments) themselves, on which the forces 
and torques also depend. These two changes would in principle compensate each 
other; the larger part of the compensation comes from the lower order terms, and a 
smaller part (negligible to some extent) from the higher order terms. Hence, in an 
approximation where only moments up to 2^th order are kept, the different world- 
lines will eventually diverge. However, this does not mean that the spin condition is 
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not a gauge choice after all; in fact, it just marks the limit of validity of the given 
approximation [13]. The subtlety involved in this compensation is that, except for 
the case of flat spacetime, it does not mean that the force is the same for different 
choices of 

In order to see this, let us consider first, in Newtonian mechanics, the problem of 
describing an extended body through different reference points; for more details on 
this problem, we refer to Sect. 3 of [42] . Consider a spherical body in a gravitational 
field G(x). If one takes to be the body’s center of mass = -^cm = / — 

a unique point, in Newtonian mechanics — then, with respect to z^ , the body is effec- 
tively a moncyole, and the only force present is the usual (monopole) gravitational 
force F = Fg = mG(z). Now take a different reference point f = z^ + 

(not a centroid) say, at the boundary of the sphere. The monopole force changes 
to Fg = mG(z -h Ax); but, on the other hand, the particle has a mass dipole mo- 
ment do = f pxd^x = —mAx about z^ (as well as quadrupole and higher order 
moments). The dipole force is = VjG^dQ, hence to dipole order, we have the 
same net Newtonian force: 

F^ = F^gF = mG\z + Ax) - mVjG^z F Ax)Av^‘ mG^’(z) = F\ (39) 

In General Relativity the situation is different because the lowest order gravitational 
force is the (dipole order) spin-curvature force 

, (40) 

cf. Eq. (14), which depends explicitly on the spin condition = 0, i.e. on the 

choice of the centroid Such dependence is not compensated by a change in the 
monopole force (which does not exist), nor by the higher order terms (if that was the 
case, the pole-dipole approximation would not even make sense, for, as we shall see in 
Sect. 3.4.2, the differences in the force under different spin conditions are of the same 
order of magnitude as the force itself). Hence, the net force F^ = is different 

for different z^’s, which is natural in a curved spacetime, since the differentiation is 
along different curves. On the other hand, although the monopole force Fg (or G) has 
no physical existence in the relativistic theory, there is a counterpart to the tidal forces 
arising from the variation of these fields from point to point, '^jGi, which comes 
from the curvature tensor (see below). And the crucial point here is that the change 
in the force F^ when one changes z^ is precisely the one needed to compensate for 
the tidal forces which “try” to make the worldlines diverge. 

This can be formalized as follows. Take two different centroids with worldlines 
z^ and z^, defined by = 0 and = 0, respectively. is the angular 

momentum about z^ and the angular momentum about z^. Extend (in a region 

small enough so that they do not intersect) these worldlines to a congruence of 
curves encompassing both z^ and z ^ ; take them to be infinitesimally close, so that 
one can employ the usual first order deviation equations (Eq. (42) below), and write 
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a connecting vector as = z^(r) — z^(r). Take moreover to be parallel 
transported along (i.e. it obeys Ohashi-Kyrian-Semerak spin condition), so that 
^Mdi = 0 ^ = MU^, and let the field be arbitrary. Noting that can be 

taken as the same for z^ and z^ (see Appendix “The Dependence of the Particle’s 
Momenta on E”), it follows from Eq. (17) that P^ = mU^ + where m = 
— P^Ua\ contracting with P^ to obtain an expression for m/M, and using Eq. (22), 
one obtains 



pa. pa 

— + [/? = — + [terms in [/?] . (41) 

and Pj2d = ^hidi reciprocal quantities; one can write one in terms of 

the other using Eq. (25). ^ P^ jM only if 7^ 0 (or equivalently if 7^ 0). 

Erom the deviation equation for accelerated worldlines [43], we have 

+ VaxQ" = + (VuLf" - VuC/") 

E“^Ax /3 + + [terms in Uf] , (42) 

where is the “gravitoelectric” tidal tensor, which is the rela- 

tivistic counterpart of the Newtonian tidal tensor G/ . In the third equality we used 
Eq. (41) and the following: M is a conserved quantity for the OKS spin condition 
(VxjM = 0 along z^ if P^ || as readily seen contracting (40) with P"), so that 
= DP^/dr = Ma^\ and that along z^ one has = 0 + [terms in G^]. 
Since Ax^ is infinitesimal, we can write (as in fiat spacetime), 

5"^ = 5"^ + Ax^' (43) 



Zl^Ax" 

dr^ 



t/“ 



pa 

Id ^ 



pa p 

1 , ^hid^Q^ 1 

^ m 2 



and therefore the difference between the forces is 

pa _ ^575 _ 575^ + [terms in U^] 

= ME“'^Ax^ + [tenns in Ul] ( 44 ) 

where the terms in are of order O(S^). Substituting in (42), we obtain 

D^Ax^ 

— = 0 -h [terms in U^] . 

That is, the worldline deviation of the two solutions reduces to terms involving 
(i.e. Pj{-^j), that we have seen in Sect. 3.1 to be gauge (arising just from the 
choice of observers relative to which the centroids are computed). This is illustrated 
in Pig. 5. In particular, if one takes two different solutions of the OKS condition 
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Fig. 5 a Two different centroids of the OKS condition move nearly parallel (DAx‘^ /dr = 0). In a 
curved spacetime, that implies that the force DP^/dr along the two worldlines must be different 
(e.g. if z^(r) is a geodesic, z^(f) cannot be), b Centroids z^(f) of other spin conditions accelerate 
relative to OKS centroids due to the gauge motions induced by the variation of along z^ (inertial 
hidden momentum) 



(so that no superfluous motions come into play^) we have simply D'^Ax^/dr^ = 0, 
i.e. there is no relative acceleration between the worldlines, which is guaranteed by 
the difference between the forces and 

The situation becomes especially enlightening (and the correspondence with the 
Newtonian theory closer) in the limit of weak static flelds and slow motion of 
Sect. 3.4. In this case the coordinate acceleration (for a stationary held) of the centroid 
is 



dr^ 



= mG‘(z) + f‘ 



DR 



hid 



dr 



where G is the Newtonian held (more precisely, a flctitious, or “inertial” held, that 
mimics Newton’s G in the coordinate acceleration. Is also known as the “gravito- 
electric” held, e.g. [23]). The coordinate acceleration of the centroid is 



_ T\pi r\pi 

m — - =mG‘(z) + F” ^ = niG‘ (z) + F‘ +mE‘JAxj 

dr^ dr dr 

in the second equality we used (44) neglecting the terms therein (as they are 
of order To first order in Ax, G^(z) ~ G^(z) + G^ Axj \ and since, for 

a stationary held, to linear order (see [23]), Eij = —VjGi, then mG\i) -\- F^ = 
mG^ (z) + F ^ , i.e. the sum of the spin curvature and the Newtonian forces is the same 
for both worldlines, the change in one compensating for the other, just like the case 
with the monopole and dipole forces in the Newtonian problem above, cf. Eq. (39). 
We have thus 



= mG'(z) + F' - 
dr^ dr 



= m 



d^z‘ 

dr 



+ 



dr 



dr 



(45) 



^Only when Vui^ “ = = 0 should one expect two different centroids of the same body to 

move parallel, even in flat spacetime, as explained in Sect. 3.1 (see also Fig. 2b). Otherwise (i.e. 
when 7 ^ 0) they can have an arbitrary relative motion, cf. Fig. 5. 



240 



L.F.O. Costa and J. Natario 



Hence, barring hidden momentum terms, the coordinate acceleration for z^(f) is the 
same as for z^(r). This means, in particular, that the different solutions of the OKS 
condition are trajectories that run parallel (as both the coordinate acceleration and 
the velocity are the same for all of them). 



3.4 Comparison of the Spin Conditions in Simple Examples 

In this section we consider the two simple setups illustrated in Fig. 6 — a spinning 
charged body (with /I = 0, and whose only non- vanishing electromagnetic moment 
is so that = 0) orbiting a Coulomb charge in flat spacetime, and a spinning 
body orbiting a Schwarzschild black hole, both particles having spin S lying in the 
orbital plane — and compare the description of the motion given by the different 
spin conditions. Such comparison will be done ensuring that one is dealing with 
the worldlines of different centroids corresponding to the same physical body (i.e. 
the same matter distribution T^^{x)). We will be using the weak field slow motion 
approximation, for two reasons: first, because it is sufficient to illustrate the effects 
of interest; second, and more importantly, to make clear that the choice of spin 
condition (and the resulting hidden momentum) impacts the equations of motion at 
leading order, and thus these effects must be taken into account in any linearized 
theory or Post-Newtonian approximation. 



3.4.1 Electromagnetic System 

The Corinaldesi-Papapetrou (CP) condition, which sets the reference worldline 
as being center of mass vgj^(wiab) measured in the “laboratory” frame (chosen as the 
congruence of static observers dX rest with respect to the source), coincides in 
this case with one of the solutions of the Ohashi-Kyrian-Semerak (OKS) condition, 
because such frame is inertial, and therefore = 0 = 0, cf. Eq. (18). 

The momentum is thus parallel to the 4- velocity = mU^, and the equation of 

motion for the centroid reduces to ( 2 = charge of the source) 

ma°‘ = F°‘ =qF°‘l^U 3 -, F = qE{U) = qQ'^ + 0{v^) , (46) 

whose well known solution for a Coulomb field is an ellipse. In particular, a particle 
with an initial velocity in the plane, and equaling that of a circular orbit, will 
follow a circular orbit in that plane (regardless of its spin); and a particle with initial 
radial velocity will move radially, cf. Fig. 6a. To compare with the description given 
by other centroids (corresponding to other OKS solutions, or to other spin conditions), 
we note that (i) these have worldlines z^if) related to z^ir) by (see Eq. (10)) 
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Fig. 6 Comparison of different spin conditions (S^^up = 0) in two analogous physical systems: 
a-b A spinning charged particle (but with Jl = 0) orbiting a Coulomb charge in flat spacetime-, 
c-d a spinning particle in the Schwarzschild spacetime. The CP condition, — ^£b’ chooses the 
centroid as measured by the observers at rest in the background (the “laboratory” frame); the FMP 
condition, = U^, and the TD condition, /M, choose the centroid as measured in the 

frame comoving, or nearly comoving (respectively) with it. We consider only the non-helical FMP 
solution. In the electromagnetic system, the CP condition yields Du^ j dr = 0 P^ = mU^ 
(since the laboratory frame is inertial), thus there is no hidden momentum nor exotic motions; 
trajectories are ellipses and, for particles with initial radialjvelocity, straight lines, cf. Fig. 6a. For 
the TD/FMP conditions. Fig. 6b, Du^ /dr ^ 0 since a force F acts on the particle, leading to a hidden 
momentum Phid — x F /m that modifies the trajectories. A bobbing is added to the elliptical 
trajectories due to the oscillation of Phid = along the orbit; and instead of a radial motion, the 
centroid defiects. In the gravitational system the situation is Du^ / dr ^ Q ^ Pj^^ ~ Ofor 

TD/FMP conditions, and it is for the CP condition (since the laboratory observers are accelerated) 
that there is a hidden momentum Phid — S y. G ^ 0. The TD/FMP centroids with initial radial 
velocity move radially, whereas the corresponding CP centroid defiects. Phid also induces a bobbing 
in nearly elliptical orbits (adding to the existing bobbing caused by the spin-curvature force, which 
is not gauge, but has the same form up to a factor of three). In both systems Pj^^ and its induced 
motions are gauged away by the Ohashi-Kyrian-Semerak (OKS) condition; different OKS centroids 
move nearly parallel to each other. 
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= z“ + Ax“ ; A3f ~ , (47) 

m 

where v is the particle’s velocity with respect to the laboratory observers (i.e. 

= 7 ([/, + i;^), cf. Eq. (9)); (ii) the particle’s momentum is, to 

a good approximation, the same for all spin conditions, cf. Appendix “The Case with 
Electromagnetic Eield”; and (iii) regardless of the reference worldline chosen, to the 
accuracy at hand, the net force on the body is the same, F = F (unlike one might 
expect, since the fields are evaluated at different points). Point (iii) is explained by 
arguments analogous to the ones given in Sect. 3.3 for the Newtonian problem: when 
one changes the particle’s moments change as well; if the particle was a monopole 
with respect to z^, then about z^ it will have an electric dipole moment d = —qAx, 
as well as higher order moments. Whereas F is just the Coulomb force F = qE{z), 
to dipole order F = qE(z) F^-p, where F^-p = V jE^d^ + 0{v^) is the force due 
to the electric dipole, cf. third term of Eq. (14^ Hence 

F^ =qE\z)FF^^^^ = qE^{zFAx)-qVjE\zFAx)Axj qE\z) = F\ (48) 

Therefore, any difference in the acceleration of the two centroids is due solely to the 
hidden momentum 



ma = F = F 

dr dr 



^ DP hid 

= ma 

dr 



(49) 



This tells us that different solutions of the OKS condition, for which = 0 
(corresponding to the centroids as measured by observers moving with constant 
velocity with respect to wg^), yield different worldlines all (nearly) parallel to the CP 
solution, since they have the same acceleration, and the same 4-velocity || P^. In 
particular, other OKS centroids z^ corresponding the same physical motion whose 
description through is radial motion, are non-radial straight lines parallel to z^; 
and the z°^’s for which z^ is a circular motion are (non-concentric, in general non- 
complanar) circles, obtained from the latter by a constant spatial displacement Ax; 
see Pig. 6a. 

The situation is different if one chooses the Prenkel-Mathisson-Pirani (PMP), 
or the Tulczyjew-Dixon (TD) condition, /M, which pick as rep- 

resentative point the centroid as measured in the frame comoving, or nearly comov- 
ing, respectively, with it. Since a force acts on the particle (implying DP^ /dr ^ 0 
and d^ ^ 0), it follows that Du^ /dr / 0 and / 0, and therefore P^ is not is 
not parallel to cf. Eqs. (28) and (30). Prom Eq. (28) (and noting from (29) that 
M = m + 0(5'^)), we have for the TD condition 
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This corresponds also to the momentum of the non-helical FMP solution. In order 
to see that, first take (50) as an ansatz, and observe it obeys, to the accuracy at hand, 
the FMP equations of motion. Namely, substituting (50) in the explicit equation for 
the acceleration^ (37), one gets 



(in the second equality we noted that since Sr^ — and F^Ua = 

0, it follows that = projection of F^ orthogonal to S^); 

then, substituting (5 1) in (30), leads consistently to (50). Equation (51) states that the 
acceleration comes, in first approximation (i.e. to zeroth order in S), from the force 
F^, which is what one expects for a non-helical solution (and the rationale for taking 
(50) as an ansatz for (30)). Note from Eq. (34) that, for a free particle in fiat spacetime, 
the acceleration of the helical motions is a = j'^vMIS = 0(S~^) / 0(S). The 
helices are effectively precluded from moment we impose 2:^ mU^ F S^^Fp jm. 

We can therefore write for the inertial hidden momentum of both the TD or (non- 
helical) FMP conditions ^ 

/’hidi - (52) 

This hidden momentum leads to exotic motions of the centroid. From Eq. (49), 



Here r = xex + ycy, since F(= F) is the Coulomb force in the xqJ plane. 

Nearly circular motion . — Let us start by the motion above whose description 
through (i.e. through the laboratory frame centroid, given by the CP/OKS con- 
ditions) was a circular motion in the xoy plane. Since we assume that S lies on the 
xoy plane, it follows that Phidi = fliidi^z, 

= mv^ = P^ = mv^ = mv^\ P^ = mv^ -h Phidi 

(where we noted that since P^ is the same regardless of the spin condition, the 
components of the centroid velocity in the plane are the same as for the CP 
condition: = v^, = v^). Therefore, i; • r = 0, md^ = ma^ = P^, md^ = 

ma^ — P^, and 




FpS^^ + 0{S) = — + 0{S) (51) 

/ m 



ma = ma — 




dr m 




(53) 



^In [13], where Eq. (37) was originally derived, was taken to be the spin-curvature force (40); 
it is however easy to check, following the derivation therein, that it holds for an arbitrary force, as 
long as Eq. (30) holds. 
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-z 1 

ma = X S) , 



m r 



(54) 



cf. Eq. (53). Thus, the projection of the motion in the xqJ plane is circular, identical 
to and since S is constant to this accuracy, oscillates between positive and 
negative values along the orbit, leading to a bobbing motion, depicted in Fig. 6b. 
This bobbing can easily be understood as follows. The particle’s total momentum 
along z is constant (since there is no force along z, DP^ /dr = 0), and equal to 
zero, as one can see (since is the same) from the results above of the CP/OKS 
conditions. On the other hand, from Eq. (52), there is a hidden momentum along 
= — (5 X py jm, oscillating along the orbit (from P^^^ = —SF/m to 
^max — +SF/m); this means that the centroid bobs up and down in order for 
the kinetic momentum mv^ to cancel out keeping P^ = 0. Without loss of 
generality, we may take S = Sex, = v coscjr, and thus (v x Sy = —vS cos ujt; 
integrating z = Eq. (54), and noticing that mco^r = Qqlr^ (as the motion is 
circular in the xqJ plane), we obtain z = (vS/m) cos uot, describing oscillations of 
half amplitude vSIm. 

Nearly radial motion . — As we have seen above, for the CP or the OKS conditions, 
it follows from Eq. (46) that a particle with radial initial velocity will move radially, 
regardless of its spin. Take the case that the particle is dropped from rest at some 
point on the x axis; it will move in straight line along x towards the source, and we 
thus have that = 0, = P. Take S to be along z. For the FMP or the TD 

conditions the situation is different; there is a hidden momentum, given by Eq. (52), 



(increasing as the particle approaches the source, since F increases), which causes the 
centroid to deflect in the negative y direction, in order to keep py = py,^+muy = o. 
This is depicted in Fig. 6b. 



3.4.2 Gravitational System 

In the gravitational case, the situation is reversed in comparison to the electromagnetic 
system: now it is the “laboratory frame” (i.e. the observers at rest in the background) 
which is accelerated, therefore / 0 and P^ y when the centroid is 

computed in such frame, cf. Eq. (27); and it is when the centroid is computed in the 
comoving (FMP condition) or nearly comoving frame (TD condition) that we have 
P^ ~ mU^, since the only force present is the spin curvature force (40), which 
yields slO(S^) contribution to the hidden momentum, cf. Eq. (50). This is what we 
are now going to see in detail. 



^^Since = 0, is Fermi- Walker transported for the MP condition, Eq. (35), and approximately 

so for the TD condition (cf. Eq. (7.1 1) of [26]); since Eq. (54) is of first order in v, S can be taken 
constant therein. 




m m 
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As we have seen in Sect. 3.3, the force (40) depends explicitly on the spin condi- 
tion. For the FMP and the TD conditions, it can be written to lowest order as [24] 

F' = -2ejuv'‘G^’‘sj + ~ ma‘ , (55) 

where G = —m^r/r^ is the Newtonian (or gravitoelectric) field evaluated at 
ms is the mass of the Schwarzschild black hole, and in the second equality we used 
P = mil -h 0{S^) ^ md 2:^ F, as follows from (50). Explicitly: 



ma 



F = - 



3ms 



i; X 5 + 



2r[(v X r) 



S] ^ (v • r)S X r 



(56) 



Notice the first term, formally analogous to the first term of (53), which caused the 
bobbing in the electromagnetic system; but note as well that despite the similarity, 
they have very different origins: the latter comes from the inertial hidden momentum, 
whereas the former comes from the spin-curvature force. 

The coordinate acceleration is given by the sum of ma with the (radial) Newtonian 
“force” mG, 



m 



df^ 



= mG\z) + F‘. 



For the CP condition the situation is different, because this is now the case where 
the field (relative to which the centroid is computed) is not parallel 

transported along z^(r), Vxjwg|^ / 0; therefore there is hidden momentum (cf. 
Eqs.(18)and (27)): 

= = ^ Pmi-SxG. (57) 

The spin-curvature force takes also a different form with this condition, 

F' = -€jktv^G‘'‘sj + (58) 

(notice the relative factor of 2 comparing the first terms of (55) and (58)). The latter 
difference however is compensated by the difference between G(z) and G(z), as 
explained in Sect. 3.3: G^(z) — G^(z) — Ax j, with Ax = S xv/m,cf. Eq. (47); 
thatis,G^(z) — G^z)— ejuS^ /m, and thcrcf ore mG^z)-\-F^ = mG^z)-\-F\ 

The coordinate acceleration is thus given by 



m 



d^z‘ 

dr^ 



wG'(z) + F' - 



DPm 

dr 



m 



d^‘ 

df^ 



DPU 

dr 



(59) 



That is, the coordinate acceleration of the CP worldline z^{r) differs from that of 
the worldline if) of the TD/FMP conditions only by the hidden momentum term 
involved in the former. From (57) we have 
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^^hid 

dr 



^hid,i 



V = e 






_ms I 

-~^y 



-f I 5 X 5 + 3 



(v ■ r)S X r 



) 



(60) 



where we used the fact that, to this accuracy, DS/dr ^ 0. 

Nearly circular motion . — As in the electromagnetic case, we assume S e xqJ, 
and so, for a nearly circular orbit, (v x r) • S ^ 0, v • r ^ 0; therefore, the second 
term of (56) and the last term of (56) and (60) vanish. We have thus for the FMP and 
TD conditions 

m^= niG^z) - ^(v x Sf , 
dr^ r^ 

and for the CP condition 



m 



d^z d^z 



ms 



= ^(^ x 5') 



dr^ dr^ 



Both coordinate accelerations oscillate along the orbit, due to the terms v x S (since 
S is approximately constant), leading to a bobbing motion depicted in Fig. 6c, d. 
Hence, by contrast with the electromagnetic system, in this case a bobbing is present 
regardless of the spin condition (or the presence of hidden momentum); it is just 
larger for the CP condition, because the contribution for the bobbing from the hidden 
momentum adds to the bobbing caused by the spin-curvature force (they have the 
same form, only different factors). 

Nearly radial motion. — For a particle in radial motion in Schwarzschild space- 
time, the spin-curvature force under the FMP/TD conditions is exactly zero, = 
Vu+“ = 0, as shown in [24] (in the weak field and slow motion regime, one can 
check that from Eq. (56) above, by noting that the second term is zero, and the first 
and third terms cancel out when r || v). The hidden momentum is also exactly zero for 
the TD and the non-helical FMP solutions, so = mU^, and thus DU^/df = 0. 
When dropped from rest, the particle moves along a geodesic towards the source. 
Take the motion to be along the x axis, so that py = = 0, = P, and take S 

along z. 

For the worldline z^ (r) given by the CP condition, there will be a non- vanishing 
spin-curvature force, cf. Eq. (44); which, as shown above and in Sect. 3.3, just com- 
pensates for the difference in the Newtonian field G on the two worldlines, so that the 
coordinate acceleration differs only due to the hidden momentum terms, cf. Eqs. (45) 
and (59). Since the momentum P^ is the same regardless of the spin condition, the 
hidden momentum (57) that arises with this spin condition causes the centroid z^ to 
deflect in the y direction as it approaches the source, in order to keep 

py = mU^ FiSxdy = 0, 



just like the situation in the electromagnetic system for the FMP and the TD condition. 
This is depicted in Fig. 6c. Hence, the situation is opposite to the electromagnetic 
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analogue: for the FMP and TD conditions we have no hidden momentum, and 
has straight line radial motion; for the CP condition there is hidden momentum and 
a centroid that deflects from radial motion. 

Finally, if one takes solutions z!^(j') of the OKS condition, other than the one 
that (to this accuracy) coincides with the centroid z^ of the FMP and TD conditions, 
we have, from (45), d^z!^ = d^z^' Idf^, and thus z!^{r'^ are curves that run 

approximately parallel to the trajectories of the TD/FMP (non-helical) conditions. 



3.5 Hidden Momentum Arising from the “Torque” Tensor 

In this section we briefly discuss the hidden momentum (19) that is related to the 
torque tensor It is useful to split 

^ — ^DEM ^ ^QEM ^ ^QG ^ 

where [11] 

^DEM = t/T' ( 62 ) 

is the electromagnetic dipole torque, and Tqq are, respectively, the quadrupole 
electromagnetic and gravitational torques (the lowest order torque in the gravitational 
case), see [24] for the explicit expressions. All these torques (plus the higher order 
ones) will contribute to the momentum via Eqs. (17)-(19). A hidden momentum Phidr 
is originated whenever has a component along the vector field cf. Eq. (19), 
and it may be cast into two parts: a part which is pure gauge like the inertial hidden 
momentum (comes from the choice of the reference worldline z^ (r) ; may be 
made to vanish by suitable choices), and another part, which arises in some physical 
systems, that is not gauge. Let us discuss these two subtypes of hidden momentum 
separately. 



3.5.1 The Pure Gauge Hidden Momentum that Arises from 

This contribution is easier to understand if we think about a simple example. Con- 
sider a spinning particle in fiat spacetime as depicted in Fig. 1, with no forces 
{DP^/dr = 0), but now under a torque. Consider moreover to be spatial and 
orthogonal^ ^ to P^: Pp = 0. In this case, just like for a torque-free particle, the 

centroid v^j^(P) (“vcm” in Fig. 1), given by the condition Pp = 0, is at rest in 
the = 0 frame (note that P^ || for the reference worldline z^ = x^^(P), as 
follows from Eq. (16) with = P^/M). Since vg^^(P) is unaffected by the torque. 



^^E.g. the torque on an electric dipole in an uniform electromagnetic field, when is the common 
centroid given by the TD or the (non-helical) FMP condition. 
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it remains at rest at the body’s geometrical center, regardless of the fact that the spin of 
the particle is varying. Now consider another inertial observer (4-velocity u^) mov- 
ing with respect to the = 0 frame with constant velocity v (so that Du^ /dr = 0 , 
ensuring that no inertial hidden momentum comes into play); not only the centroid 
it measures is shifted to the right relative to as depicted in Fig. 1, 

as the shift ( 10 )-( 12 ) also varies, since varies due to the torque: 



dr 



1 DS'i^ 
m(u) dr 



-U(5 = 



m(u) 






(63) 



(i.e. the body’s rotation velocity varies, causing to vary). This means that the 
centroid = xgj^(i/) will be moving in the = 0 frame; i.e. its 4-velocity 
fj^ = dz^ I df will have a component orthogonal io which reads (in this special 
case that DP^/dr = 0, so that Eq. (24) holds) Uf = D/^x^ldf. 

In the general case when there are forces acting on the particle, however, as 
already mentioned in Sect. 3.2, one should not think of IJf as the velocity of the 
centroid z^ relative to z^ = xgj^(F), because the latter is not at rest in the = 0 
frame. The general argument should be given instead as: the position of the centroid 
-a _ x^yfu) as measured by a given observer depends on the body’s angular 
momentum', when the latter varies due to the action of a torque, Xq^(u) moves 
accordingly; P^, however, is unaffected, leading to P^ ]){ The general (with 
Du^ /dr = 0) expression for Uf_ formalizing this statement follows from Eqs. (16) 
and ( 22 ) 12 ; 



Ul = f-(/*^)“ . (64) 

m{u) 

Finally, if Uf_ 7 ^ 0, then 7 ^ 0 — i.e. when the centroid moves in the = 0 
frame, the momentum P^ is not zero in the centroid frame (the = 0 frame); thus 
there is hidden momentum, the two effects being reciprocal (and mere consequences 
of the fact that P^ U^), cf. Eq. (25). 



3.5.2 “Dynamical” Hidden Momentum 

In general, the momentum of a multipole particle subject to electromagnetic and 
gravitational fields is not parallel to its 4- velocity regardless of the spin condition; 
that happens when is not a spatial tensor (i.e. when r^^up 7 ^ 0 for all unit 
timelike vectors u^), and is related to a type of hidden momentum which occurs in 
some physical systems and is not gauge. Following [17], we dub this part of 
the “dynamical” hidden momentum. To dipole order, it arises in magnetic dipoles; 



^^To obtain (63) from (64) in the special case above, one uses dr = 7(1/, P)dr to write Uf_ = 
y(U, P)DAx^ /dr, computes /dr from (43) using (15) to obtain = 7 ( 1 /, P)r^^ , and 
finally uses the assumption above Pp = 0 ^ 
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let us then consider the case when in Eqs. (61) and (62). Take the 

magnetic dipole moment to be proportional to the spin, = cjS^^\ we have from 
(19), for an arbitrary spin condition = 0, 

^ ^“dEM . 

where (E^)^ = . If (£’“)^ ]){ /i^, never be zero, because then 

^ 0 and is a space-like vector, thus cannot be parallel to any LT^. For the 
FMP condition (u^ = U^), P^idEM t^kes the suggestive form 

^WdEM = . (65) 

where E^ = is the electric field as measured in the centroid frame. In such 

frame, and in vector notation, ^idEM = Jl ^ E, which the most usual form in 
the literature for the hidden momentum that a magnetic dipole acquires under an 
external electrom^netic field (e.g. [19-22]). It equals minus the electromagnetic 
field momentum generated by a magnetic dipole when placed in the external 
electromagnetic field, which, in the particle’s frame, reads (see e.g. [19, 22, 24]) 
= f E X ^dipole = — yS X It should howcvcr be noted that PhidEM is not 
field momentum; it is purely mechanical in nature, which can be understood through 
simple models, see e.g. [19, 22] (in particular Fig. 9 of [22]). Such momentum plays 
an important role in the conservation laws. Consider, for example, a magnetic dipole 
at rest in an external electric field; since no force is exerted on the particle, the setup 
is stationary, it follows from the conservation equations (7iot)°^(^^ = 0 that the total 

spatial momentum Ptot = ^matter + ftM (i-e. the matter momentum plus the field 
momentum) must vanish. The momentum of the electromagnetic field, PpM = , 

however, is not zero; it is the momentum ^matter = ^hidEM = — ftM, hidden in the 
dipole, that cancels out Pem, ensuring Ptot = 0, as required by the conservation laws. 

^WdEM leads to exotic motions, quite analogous to the ones coming from the 
inertial hidden momentum studied in Sect. 3.4, as one would expect from the formal 
analogy between (65) and the inertial hidden momentum under this spin condition, 
Phidl = cf. Eq.(30). Indeed, if in the application in Fig. 6a, b we 

considered particles with dipole moment = a ^ 0, there would be a bobbing 
(in addition to the one caused by for a particle orbiting the source, and, in the 
case of a particle in initially radial motion, there would be a sideways dipole force on 
it, but due to / e-^em particle’s sideways acceleration would actually be opposite 
to the force. This effect is discussed in detail in [24]. However, a crucial difference 
exists between these effects and the effects discussed in the previous sections: the 
hidden momentum in Eq. (65) is not gauge, nor the motions generated by it are (in 
general) made to vanish by any choice of center of mass. 



250 



L.F.O. Costa and J. Natario 



4 Conclusion 



In this paper we have discussed and compared in detail the different spin supplemen- 
tary conditions in the literature, with special attention being given to the lesser-studied 
(but potentially useful) Corinaldesi-Papapetrou (CP) and Ohashi-Kyrian-Semerak 
(OKS) spin conditions. One of the main points is that the different solutions allowed 
by the different spin conditions are equivalent descriptions of the motion of a given 
body. We have shown this equivalence to pole-dipole order, explaining the change of 
the spin-curvature force under the different conditions — which is seen to be precisely 
what ensures the consistency of the different solutions, as it has the magnitude needed 
to prevent the worldlines from deviating due to tidal effects of a curved spacetime. 
This builds up on the work in [16] (dealing with free particles in flat spacetime) and 
backs the claims in [13] about the equivalence of all spin conditions in a curved 
spacetime. 

We clarified the origin of the non-parallelism between and which can 
be cast as the particle possessing a “hidden momentum”, a concept introduced in 
General Relativity in [17], and further developed herein. It consists of two main 
parts: an “inertial” part that arises solely from the spin condition (i.e. from the 
choice of the observers relative to which the center of mass is measured), which 
we therefore cast as gauge, and another term arising from the torque tensor 
Ta( 3 , which generically sub-divides into a part that again is gauge (arising from the 
motion of the centroid measured by some observers that is induced when varies 
due to Tq,/ 3 ), and a “dynamical” part, which is not gauge. The latter, to dipole order, 
consists of a form of hidden momentum that arises in electromagnetic systems, and 
was previously known from treatments in classical electrodynamics. 

The differences between the various spin conditions were discussed and illustrated 
with suitable examples; in particular the reciprocity (first noted in [17]) that exists 
when one compares spinning particles under an electromagnetic field in fiat space- 
time to spinning particles in a gravitational field: in the first case, when one picks 
the centroid as measured in the Laboratory frame (corresponding to the CP/OKS 
conditions), there is no inertial hidden momentum, Pj^^j = 0, and thus (if = 0) 
the momentum velocity relation is simply = mU^\ and when one computes the 
center of mass in the comoving frame (FMP/TD conditions), P*^ is no longer parallel 
to leading to exotic motions (like bobbings). In the gravitational case the situa- 
tion is reversed: when one chooses the TD or the (non-helical) FMP conditions, P^ 
is approximately parallel to ; and it is when one chooses the Laboratory centroid 
(CP condition) that hidden momentum arises. 

All the spin conditions studied present interesting features. The CP condition 
yields a natural description, as it amounts to compute the centroid in the same frame 
where the motion is observed (the “Laboratory” frame, which is given in advance); it 
leads however to considerable superfluous motions in gravitational systems. The TD 
condition defines always an unique center of mass, which is the central worldline of 
the worldtube of centroids (can thus be thought of as describing the “bulk” motion 
of such worldtube). The FMP condition yields the most natural transport law for the 
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spin vector, and also gives rise to exact gravito-electromagnetic analogies (see [24]); 
however it is not always easy to single out the non-helical solution from the (infinite) 
helical solutions allowed by this condition (the latter should be avoided, as they are but 
unnecessarily complicated descriptions of the motion, as discussed in Sect. 3.2.3), and 
no general prescription for that is known. As for the OKS condition, it always gauges 
away the inertial hidden momentum and its induced motions, ensuring the simplest 
equations for the centroid motion; in the absence of torques, one has = ma^, i.e. 
these are Newtonian-like (or “dynamical”) centroids, which accelerate only if there 
is a force. 

It is however crucial to notice that in spite of the equivalence of the descriptions, 
and the fact that the trajectories of the different spin conditions are contained within 
the (convex hull of the) body’s worldtube, their differences, and the superfluous 
motions induced by some of them are not negligible (even in weak field, slow motion 
approximations), and should not be overlooked. As it is also important to distinguish 
these motions from the physical effects. For, as we have exemplified in Sect. 3.4, the 
pure gauge contribution to the centroid acceleration with the CP condition is of the 
same order of magnitude as the one from the spin-curvature force itself; and it can 
actually be much larger, as is the case of the acceleration of the outer helical solutions 
of the FMP condition, which can be made arbitrarily large. 
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Momentum and Angular Momentum in Curved Spacetime 

In rectangular coordinates in fiat spacetime, the momenta and of an extended 

body, as measured by some observer of 4- velocity , are well defined by the integrals 



where E(z, w) is the hyperplane orthogonal to (the rest space of u^), and 
is the vector connecting the reference worldline to the point 
of integration x^. In curved spacetime the situation is different, as these integrals 
amount to summing tensors defined at different points; different generalizations of 
the fiat spacetime notions have been proposed in the literature (e.g. [11, 25, 26]), 
none of them seeming a priori more natural than the others. Herein we discuss the 
mathematical meaning of the definitions used in this work, and how they relate to 
the schemes by Dixon [11, 26]. 

All schemes agree on generalizing E (z, w) by the geodesic hypersurface orthogo- 
nal to , and on replacing by the vector X g 7^ tangent to the geodesic connecting 
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and and whose length equals that of the geodesic. That is, X = 0(x), where 
O = exp“^ is the inverse exponential map, mapping points in the spacetime mani- 
fold to vectors in the tangent space ^ \ M Where the schemes differ is in 
the way the vector is integrated. We adhere to the scheme proposed 

in [25]: using the natural map for tensors induced by exp^ to pull back the energy- 
momentum tensor and the volume element to 7^, and integrate therein, which is then 
a well defined tensor operation. Let denote an orthonormal co-frame on the 
moments can then be written in the manifestly covariant form 

= f T(0*S2^, JX) ; 

= 2 [ dT) 

Jy.{z,u) 

Notethat since T( JX) = (exp* exp* JX), one is indeed pulling back 

the integrands from Af to 7^. Note also that Eqs. (66) and (67) are equivalent to (1) 
and (2), i.e. they just amount to perform the integration in a system of Riemann normal 
coordinates {x^} centered at (the coordinates naturally adapted to the exponential 
map). This is because such system is constructed from geodesics radiating out of 
thus the components of X, in global Lorentz coordinates in 7^, are equal to the 
coordinates on M \ also the basis 1 -forms of such system are the pullbacks of 
to Ai, dx^ = 0*^2^; and, taking it comoving with (i.e., at z, = u), E(z, u) 

coincides with the spatial hypersurface = 0. 

Let us now compare these definitions with other schemes in the literature. In [1 1], 
and 5^^ are defined as 

Pdi. = [ 9aT^^di:p- = -2 [ (68) 

where cr^(x, z) = — (0(v))^ = —X^, cf. [44]. These definitions thus differ from 
(66) and (67) only in the way the vector = T^^dTi^ is integrated: is a 

bitensor which parallel transports A^ at to z^ along the geodesic connecting the 
two points, so that the integral is performed over vectors A^\z = 9a \x defined at 
z^ (in [26, 44] different propagators, in the notation therein, are employed; 

the two schemes are not equivalent though, as noted in [26]). Writing g^A^\x = 
A^ \x + XA^, with XA^ = — {x')A^dx'\ expanding the integrand in Taylor 

series around z^, and noting that, in the normal coordinates {x^} (see e.g. [27]), we 
have r|.(z) = 0 and lir|. j(z)|| ~ ||R||, where ||R|| ^ denotes 

the magnitude of the curvature, we have A.4“ = 0(||.4|| ||R||x^). Therefore = 
P" + C>(A||P||), where 



( 66 ) 

(67) 



A = ||R||a2, 



(69) 
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and a is the largest dimension of the body. Thus, when A ^ 1 , i.e. when the curvature 
is not too strong compared to the scale of the size of the body,^^ 21 P^. The 
two schemes are actually indistinguishable in a pole-dipole approximation, where 
only terms to linear order in x are kept in the integrals defining the moments; the 
resulting equations of motion are the same (compare Eqs. (43), (49) of [25] with 
Eqs. (6.31) and (6.32) of [11], or Eqs. (7.1) and (7.2) of [26]), both schemes leading 
to the well known Mathisson-Papapetrou equations (the latter derived using less 
sophisticated formalisms). These conclusions are natural, for the metric in Riemann 
normal coordinates is (e.g. [27]) of the form = 7 /^^ -h 0(||R||v^); hence the 

assumption A ^ 1 amounts to say that, /6>r the computation oi and 5^^, one 
may, to a good approximation, take the spacetime as nearly fiat throughout the body. 



The Dependence of the Particle Momenta on E 

The momenta (1) and (2) depend, in general, on the spacelike hypersurface E (z, w) = 
E (z(r) , w) on which the integration is performed, see e.g. [1 1 , 25, 26] . This is so even 
in fiat spacetime; when forces and torques act on the body, it is clear that P^{z,u), 
u) depend on z^{r), and also on the argument of E. Curvature brings 
additional complications, as is no longer a “free vector”, and E itself is in prin- 
ciple point dependent. Herein we shall show that, in the absence of electromagnetic 
field (F^^ = 0), and under the assumption A ^ 1 made above, for hypersurfaces 
E(z, u) through a point z^ within the body's convex hull, the u^ dependence of the 
momentum and angular momentum is negligible. 

Denote by ^ = dx^ a particular basis 1-form of the Riemann normal coordinate 
system {x^}\ P^ = is thus the ^ component of P". Erom definition (1), and 
since has constant components, we may write the ^ component of the momentum 
as the integral of a vector on a 3-surface, 

pHz, m ) = = f T^kad^ = f . 

Take u^ = P^/M, and consider another vector u'^ at the same point z^; the ^ 
component of the difference between the momenta computed in the hypersurfaces 
E(z, u') and E(z, u), AP^ = P^(z, u') — P^(z, u) is, from an application of the 
Gauss theorem (see Eig. 7), 

- I = I A^.gdV - I A^.gdV . 

JTsiz,u) JVtef, JvRigk, 



^^For example, in the case of the Schwarzschild spacetime, ||R|| ~ ms/r^, A = (ms I r)(a^ j 
since ms/r < 1 for any point outside the horizon, A 1 is guaranteed just by taking the size of 
the body much smaller than its the distance to the source, a^. 
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Fig. 7 Shadowed regions V^eft and VRight are the 4-volumes delimited by the hypersurfaces 
S(z, u'), S(z, u), and the boundary of the body’s worldtube, of convex hull W . is chosen 
parallel to a Curved spacetime; b flat spacetime 



Here Vheft and V Right denote the shadowed regions of Fig. 7 (where / 0), i.e. 
the “left” and “right” 4- volumes delimited by E(z, u'), E(z, u) and the boundary of 
the body’s worldtube. Now, using the conservation law = 0, one notes that 

^1/3 = = r“^Ca;/3; 

thus 

= / T'^^UodV - [ 

d ^Left d VRight 

Since ^ is a basis 1-form, ^ = 0, and 



therefore 






|AP«| < m J^T^\x\dV = mv(T^\x\), 



where V = Vie ft + y Rights { • ) denotes the average on the shadowed region of 
Fig. 7a, and we noted that is the largest component of and always positive. 
Since Z? <av(u\u) (seeFig.7), with w) definedby Eq. (9), and w) < 1, 
then (|x|) < a; moreover (assuming = uatz), V < Mav{u', u); hence we 
get 

lAP^I < MXv(u\ u) = ||P||Ai;(m', u) , (70) 



showing that is negligible^'^ compared to under the restriction above on 
the strength of the gravitational field, A ^ 1 (the same under which the different 



^^The inequality (69) means not only that the components of AP in the system {x^} (where P* = 0) 
are much smaller than M, but also much smaller than the typical spatial momentum in other frames. 
For instance, in normal coordinates } comoving with one has |P* | ^ ^(u' , u)v(u' , u)M 
andlAP^'l < |AP^|7(w', w), thus | AP^'l « |P^'| whenA« 1. 
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multipole schemes become equivalent, and one can take local Lorentz coordinates 
as nearly rectangular throughout the extension of the body; see also footnote 13). In 
the application in Sect. 3.4 — Schwarzschild spacetime, far field limit — we can write 

lAP^I < — ^a^v{u , u) ~ llPhidlll^ v{u , u) 

^Moller r 

where is the inertial hidden momentum of the CP condition, Eq. (57) (T^idi 
zero or negligible for the other solutions). Thus is negligible compared to 
under the condition ^ jv(u\u), which is reasonable in a problem where 

the particle’s spin is worth taking into account (e.g. in the problem of nearly circular 
motion in Sect. 3.4 this amounts to taking o;body ^ ^orbit. where embody and c^orbit are 
the body’s rotation and orbital angular velocities). 

Through an analogous procedure, one can show that the dependence of 
on is negligible in this regime. Let so that = 

fi:(z u) and consider the two basis spatial 1 -forms ^ and rj. Construct- 
ing the vector iah (5^ we can write the ^ <S> rj component of 5''^^ as 

S^^(z, u) = the Gauss theorem, 

= [ /%dV - [ J%dV ~ ||R|| [ x^\J\dV = ||R||y(|/|x2) 

JVLef, JVRigH, ” JV 

< ||R||fl2y(|7|) = Ay(|7|) < \a\{u', m)(|7|) , 



where In the second relation again we used t 7- = 0(||R||v). Since 

a(3 

S = 0(a^{\J\)), cf. Eq. (13), we see that indeed when A ^ 1, || ^ S. 



The Case with Electromagnetic Field 

When 7 ^ 0, the conservation law is (denoting by the 

particle's energy momentum tensor). Consider for simplicity fiat spacetime, and let 
^ be a basis 1-formof ag/< 9 Z?( 2 / Lorentz system; then = 

Note that = F^^j^ is the Lorentz force density. It follows (see Lig. 7b) 

= L ^ - L ^ - ^Righ, [nRigh) • 

J VLeft J y Right 

We have VLeft = Vmght = ^ Right^ Right Z'2- (where V^'^'> denote 

3 -volumes orthogonal iou^). Herein we allow to be any point within the worldtube 

of centroids; it follows that 
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^Left — v{u, U ) ^MollerJ 5 Right ^ ■> U ) ^MollerJ 

~ G ■ ~ (i + • 

Let inuft = Right + A/“, with II A/“|| < II V^nia; we obtain 

|AP«| < ||F^||7?Mollert^(M', U) + ||V;F^||i;(m', u)a^ . (71) 

Hence has, as upper bound, the sum of two terms: the impulse of the Lorentz 
force in the time interval ^Mollert^(i^^ u) (as measured in the =0 frame) 
between the two points where the hyperplane E(z, u') crosses the worldtube of 
centroids, plus a term analogous to the gravitational one (70). For the field of a 
Coulomb charge, discussed in Sect. 3.4.1, they read 



II^L II^MollerW(M', m) = \E^\v{u , m) ~ || PhidI II m) 

r 

2 

||V,F^||i;(M',M)a2 = |£ |t;(„»^ ~ ||Phidl||-^-n(M',M) 

^Moller r 

where ^ 2 / r is the electric potential energy, and is the inertial hidden 

momentum of the TD/FMP (non-helical) solutions, Eq. (52) (for the CP/OKS con- 
ditions, = 0). Assuming |£'p| < M, if 7?Moller/^ ^ 1 and 1 (as is 

the case in the far-field regime), then |AP^| ^ M =||P||, and AP^ is negligible 
compared to by arguments analogous the ones given in footnote 14. It is also 
negligible compared to ||Phidill under the following conditions: (i) v(u',u) 1 so 

that the first term of (71) can be neglected (this is guaranteed by the slow motion 
assumption in Sect. 3.4); (ii) that ^ jv(u',u), Si condition analogous to the 

one we obtained gravitational case above, which is reasonable whenever the particle’s 
spin is worth taking into account. 

Note that the argument above can equally be used to show that P^ does not 
depend on the spin condition. Start with the TD centroid: = Xq^(u), with = 

P^ /M\ the centroids x^y^iu') of other spin conditions are reached by x^y^iu') = 
x^y^(u) -h Ax*^, with Ax^ G E(w, z), cf. Eq. (10). Since the argument above applies 
to any spacelike hyperplane E (u' , z') through any arbitrary centroid z'^ on E (w , z), it 
effectively means that, to the accuracy at hand, P^ does not depend on the particular 
centroid chosen. 
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General Relativistic Two-Body Problem 
Theory and Experiment and the Role 
of Hidden Momentum 



R.F, O’Connell 



Abstract Emphasis will be placed on unusual aspects of our derivation, particularly 
the use of techniques from the more developed realm of QED (quantum electrody- 
namics). We obtain explicit results for the precession of the spin (rotation) and orbital 
effects and we discuss the relatively recent (2008) verification of our spin precession 
result. Unusual features of our orbital precession analysis is the appearance of hidden 
momentum effects whose origins may be traced to a special relativistic effect. 



1 Introduction 



The work of Schiff [1] in 1960 led to a resurgence of interest in analyzing and mea- 
suring the Lense-Thirring effect due to spin (rotation) effects. In particular, Barker 
and I [2], using a potential which we derived using techniques from QED, obtained 
the classical motion of a gyroscope in the gravitational field of a much larger mass 
with a quadrupole moment. Our method of derivation was much shorter and more 
straightforward than conventional methods especially as we made use of familiar 
Lagrangian concepts. Also, our results agreed with Schiff except that our equations 
of orbital motion appeared to be different than those of Schiff (although they both 
led to the same orbital precession results). Later, we traced this difference to the use 
of a different choice of coordinates [3], which was not due to the freedom enjoyed 
by the use of general relativity but due to a special relativistic effect, which we will 
discuss below in connection with hidden momentum. 

When the first binary pulsar was discovered [4], we were motivated to extend our 
results to the two-body domain [5, 6]. However, due to the nature of their system, 
Hulse and Taylor were unable to observationally verify our results. However, with the 
discovery of the double pulsar, our results were verified to an accuracy of 13 % [7]. 
In Sect. 2, we discuss the derivation of our results and their experimental verification. 
Then, in Sect. 3, we discuss the origin of hidden momentum. 
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2 Two-Body Results: Theory and Experimental Verification 

QED theory generally deals with the electromagnetic interaction of elementary parti- 
cles whereas gravitation for the most part is confined to macroscopic systems. Thus, 
we decided to start with the gravitational interaction of two electrons, given by a 
one-graviton exchange interaction [8]. Next, making use of the universality of the 
gravitational interaction, we obtain the classical macroscopic result by letting 

2 

^ 5 ( 2 )^ ( 1 ) 

2 

where and are the Pauli spin matrices associated with electrons 1 and 2 and 
where and are the classical spin angular momenta of the macroscopic masses 

m\ and m 2 . Thus, we obtain the gravitational potential energy Vi + V 2 , correct to 
order c~^ (first post-Newtonian order). This leads to the spin-dependent Lagrangian 
and Hamiltonian. In order to obtain the spin-independent Hamiltonian Hq in the 
center-of-mass system, we start with an expression give by Hiida and Okamura [9] 
which contains a quantity a which is an arbitrary dimensionless parameter which 
can take a number of different values, one of which (a = 0) corresponds to the EIH 
Hamiltonian. However, we found it convenient to select a value in order to obtain 
a Hamiltonian without a term. Eurther work enabled us to include quandrupole 
moment terms for both masses so that we finally obtained, in an obvious notation, 
the total Hamiltonian 



Htia) = Mc^ + H(a) + + Vs 2 + Vsi,s2 + Vqi + Vq 2 , (2) 

from which the total Lagrangian readily followed. The Euler-Lagrange equations 
were then used to obtain the precessions of the spins and the precession of the orbit. 
Some general comments on our results are as follows: 

(1) If we make the replacement Gm^ in the result for the electromagnetic 

interaction for positronium [10, p. 286], we obtain terms of the same structure as our 
gravitational results. The numerical coefficients are generally different excepting the 
case for the spin-spin interaction term Vsi,s2 where they are the same. 

(2) In the large mass approximation (m 2 ^ mi), the results reduce to the Lense- 
Thirring results. 

(3) Our procedure for calculating the precession of the orbit was greatly facilitated 
by making use of the fact that the Runge-Lenz vector A is a constant in the non- 
relativistic case. In addition, A and L are always perpendicular, where L is the orbital 
angular momentum and thus although both A and L process in the more general case 
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they must do so in such a way as to ensure that A • L = 0. In other words, the secular 
results for the precession of the orbit are of the form 

= ^* X L, 

Aav = ^* X A. (3) 

The fact that appears in both equations provided a beautiful check on the results. 

(4) Defining the total angular momentum as 



7 = L + (4) 

we found that 

7«, = 5* X L + X 5(1) + X 5(2) = 0, (5) 

where ^2* is the precession of L and where Qav is the precession of the spin of 
body 2, which can be obtained from the result for by interchanging the indices 
1 and 2. In other words, the total angular momentum is conserved, which provides 
another excellent check on our results. 

(5) The orbital precession Q* was broken down into time derivatives of the longitude 
of the ascending node, the argument of the perihelion and the inclination of the orbit, 
following astronomical and space physics practise [5]. This is clearly displayed in 
Figs. 2 and 3 of [11]. In addition, we note that the first verification of the one-body 
Lense-Thirring results by Ciufolini and Pavlis [12] made use of nodal precession. 

(6) One notable result which emerges is that, in the case of arbitrary masses mi and 
m 2 , the dominant spin-orbit contribution to the spin precession of body 1 is a factor 
(m 2 + /x/3)/(mi -h m 2 ) times what it would be for a test body moving in the field of 
a fixed central mass (m 1 -h m 2 ) . Here /x denotes the reduced mass m 1 m 2 /(m 1 -h m 2 ) . 
This contrasts with the result of Robertson for the periastron procession where the 
corresponding factor is unity. This result, which was published in 1975 [5, 6] had 
to await until 2008 (a period of 33 years) for its verification by the work of Breton 
et al. [7] utilizing the double pulsar PSR J0737-3039A/B [13]. This pulsar consists 
of two neutron stars in a highly relativistic 2.45 h orbit, which displays an eclipse 
of pulsar A of the order of 30 s. when pulsar A passes behind pulsar B, enabling a 
measurement of the relativistic precession of the spin axis of pulsar B around the 
total orbital angular momentum L. 

The double pulsar system, is the only system observed so far in which both com- 
ponents are pulsing neutron stars and has the added advantage that the system is 
observed nearly perfectly edge-on. Its periastron precession of 16.9°/yr has been 
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measured with enormous accuracy (to 6 significant figures) which enabled the total 
mass M = m\-\- m 2 = 2.2587 Mq to be measured also very accurately. However, in 
order to obtain the spin-orbit precession it is necessary to determine both m \ and m 2 . 
These were determined from the projected semi-major values and the Shapiro time 
delay with the results m{A) = m 2 = 1.3381 Mq and m(B) = m\ = 1.2489 Mq so 
that the measured precession rate of pulsar B is 4.77 zb 0.66°/yr, in agreement, to an 
accuracy of 13 % [7], with the theoretical rate [5] of 5.0734 zb 0.0007°/yr. Further 
details on our theoretical work and recent experiments appear in [11, 14]. 

(7) There is also a precession of L in the double pulsar system due to spin precession 
effects, as is clear from Eq. (5). The dominant effect is due to pulsar A [15] since its 
spin frequency is 122 times larger than that of pulsar B. However, the effect is only 
about 4.06'7yr and thus may be difficult to observe. 



3 Hidden Momentum 



In the introduction, we stated that, even in the 1-body case, our results for the equa- 
tions of orbital motion differed from these of Schiff but that they both led to the same 
orbital precession results. As we pointed out [3], the resolution of this problem may 
be traced to the work of Moller [16] and [17, p. 176]. He pointed out that in special 
relativity, a particle with structure and “spin” (its angular momentum vector in the 
rest system) is subject to a spin supplementary condition, which “ — expresses in a 
covariant way that the proper center of mass is the center of mass in its own rest 
system {K ^) — ” and that “ — the difference between simultaneous positions of the 
centre of mass in K (obtained from by a Lorentz transformation with velocity 
(S) and the proper centre of mass (in K ^) — ” is 



Ar = 



S X V 
mc^ 



( 6 ) 



where (S) is the spin and m is the rest mass. In essence, it is related to the fact that, 
in special relativity, there are two rest systems for the particle, zero velocity and zero 
momentum, reflecting the choice of spin supplementary conditions and the fact that 
only in special cases are the velocity and momentum proportional to each other [18] 
and [19, see Eq. (7) which for ms the basis of a more rigorous quantum mechanical 
calculation] . We refer to an extensive review for more details [20] . 

It is clear that an increase in the coordinate by an amount of Ar will give rise to an 
increase in the momentum, which is generally referred to as hidden momentum AP. 
An expression for AP was obtained in two extensive papers [21, 22] but a simple 
derivation is possible as we shall now demonstrate. Thus, taking the time derivative 
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of Eq. (6), and neglecting the very small second order terms in S which arise in the 
relation between the velocity v and momentum P, we obtain 



AP 



S X a 



(7) 



where a is the acceleration. This is a general expression for the hidden momentum. 

In particular, in the realm of electrodynamics, since the expression for a magnetic 
dipole moment M is derived from either the spin of a particle or from a steady current 
(bodies with structure in both cases), it is clear that 



AP =ki 



M X a 



( 8 ) 



where k\ is a constant. 

In the particular case where the particle with a magnetic moment M is interacting 
with a pointlike electric charge e then, the electric field E created gives rise to an 
acceleration that is given eEjm so that 



AP=k2 



M X E 



(9) 



where k 2 is a constant. This is essentially the expression used in all discussions in 
electromagnetism [23-29] for the hidden momentum. Thus, the hidden momentum 
of electrodynamics is a particular consequence of the motion of a spinning body (or, 
alternatively, a magnetic moment) for which, due to special relativistic effects, the 
simultaneous position of the center of mass in the moving system is different [by an 
amount given in Eq. (6)] than the proper center of mass. 
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Equations of Motion of Schwarzschild, 
Reissner-Nordstrom and Kerr Particles 



Peter A. Hogan 



Abstract A technique for extracting from the appropriate field equations the rel- 
ativistic motion of Schwarzschild, Reissner-Nordstrom and Kerr particles moving 
in external fields is motivated and illustrated. The key assumptions are that (a) the 
particles are isolated and (b) near the particles the wave fronts of the radiation gen- 
erated by their motion are smoothly deformed spheres. No divergent integrals arise 
in this approach. The particles are not test particles. The formalism is used, however, 
to derive the Mathisson-Papapetrou equations of motion of spinning test particles, 
neglecting spin-spin terms. 



1 Introduction 

We describe a method for modeling the relativistic motion of (uncharged or charged) 
particles moving in external gravitational and electromagnetic fields in general rel- 
ativity. The particles modify the fields in which they move and thus are not test 
particles. The modified fields near the particles are predominantly the Schwarschild, 
Reissner-Nordstrom or Kerr fields and thus the particles will be referred to as 
Schwarschild, Reissner-Nordstrom or Kerr particles respectively. The origin of the 
approach described here is the seminal and challenging paper by Robinson and 
Robinson [1], with the disconcertingly modest title of “Equations of Motion in the 
Linear Approximation”, published in the festschrift in honor of Professor J.L. Synge. 
This paper inspired early work involving space-times which are more special than 
those required here and which can be found in a series of papers by Hogan and 
Imaeda [2-4] (see also [5]). A line element, which is a byproduct of a study of grav- 
itational radiation from bounded sources [6], plays a key role in the development of 
our method. An early application can be found in [7]. The most up-to-date references 
to our technique and applications are in the paper [8] and the books [9] and [10]. 
Further development of this work has also involved graduate students Takashi Fuku- 
moto and Shinpei Ogawa in Tohoku University, Aishling Nic an Tuile in University 
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College Dublin and Florian Bolgar in Ecole Normale Superieure, Paris and has taken 
the form of M.Sc. and Ph.D. theses and an Internship Report. 

To introduce our approach we begin with the Eddington-Finkelstein form of the 
Schwarzschild line element: 

ds^ = —r^pQ^(dx^ + dy^) 2du dr ^ ^ du^, (1) 

with 

po = l + ^{x^ + y^). ( 2 ) 

Here X, y are stereographic coordinates taking values in the ranges — oo < x < +oo 
and —00 < < +oo,i/isa null coordinate (in the sense that the hypersurfaces u = 

constant are null) with — oo < w < +oo. The generators of these null hypersurfaces 
are null geodesics labelled by (x, y) and r (with 0 < r < +oo) is an affine parameter 
along them. When the mass parameter m = 0 the space-time is Minkowskian with 
line element 



dsQ = —r^PQ ^(dx^ -h dy^) -\-2dudr -\- du^ = pijdX^ dX\ (3) 

with X^ = (X, Y, Z,T) rectangular Cartesian coordinates and time and pij = 
diag(— 1, —1, —1, +1). In this background space-time with line element (3) the 
hypersurfaces u = constant are future null cones with vertices on the time-like 
geodesic r = 0. We can consider 



2m 2 

yij dx dx-' = du 



( 4 ) 



in coordinates x^ = (x , y , r, w) , as a perturbation of this background which is singular 
on the time-like geodesic r = 0 and which, when added to the line element (3) 
produces the line element (1). Although we have no need to assume it now, we shall, 
in the sequel, assume that m is small of first order and write m = 0\ 'm order to 
simplify the complicated calculations that develop. We shall consider the background 
space-time a model of the external field in which the small mass is located. In the 
example here the background space-time is Minkowskian space-time and thus there 
is no external field present. Also in this example the time-like geodesic equations 
(satisfied by r = 0 in (3)) are the equations of motion of the small mass. 

In the space-time with line element (3) let X^ = (u) be an arbitrary time-like 

world line with u proper time or arc length along it. Then {u) = dvY jdu is the 
unit tangent to this line and satisfies vjv^ = rjijv^v^ = +1 along the world line. 
Equivalently is the 4- velocity of the particle with world line X^ = w\u). Thus 
= dv^ I du is the 4- acceleration and satisfies ajv^ = pija^v-^ = 0 along the 
world line. We shall lower and raise indices, referring to the coordinates X ^ , using 
rjij and rf^ respectively, with the latter defined by rf^rjjk = 
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are the components of the 4- velocity and 4-acceleration of the particle expressed 
in the coordinates . These features of the world line can be incorporated in the 
Minkowskian line element by replacing (3) by 

dsQ = —r^PQ^{dx^ + dy^) -\- 2 du dr — 2hor)du^, (5) 



with 

Pq = xv^ (u)-\-yv^(u)-\- 
and 






v"^(u) + 



l + -(x2+j2) 



ho = T-OogPo) = ajk\ 
au 



/(m), ( 6 ) 



(7) 



with 



ki = Pq-* 



—xd{ — y^{ — 



l--(x2 + /) 



5I + 



l + -(x^ + y^) 



Si 



( 8 ) 



In (5) r = 0 is the world line (u) with the 4- velocity components appearing 

in (5) via the function Pq and the 4-acceleration appearing via Hq. The future null 
cones with vertices on this world line are the null hypersurfaces u = constant. The 
generators of these null cones are the null geodesic integral curves of the vector field 
y , which satisfies = 0 and kj = + 1 . On each future null cone the generators 
are labeled by the stereographic coordinates v , y and r > 0 is an affine parameter 
along them. To see how Einstein’s vacuum field equations can lead to the equations 
of motion as we have defined them in the paragraph above we consider a space-time 
with line element 



ds^ = —r^P^ ^(dx^ -h dy^) 2 du dr ^ — 2hor ^ du^, (9) 

which is a generalization of the Schwarzschild space-time with line element (1). The 
Ricci tensor components in coordinates x^ = (v, y, r, w) are now given exactly via 

Rijdx‘dxj = du^. ( 10 ) 

With m 7 ^ 0 we see that the vacuum field equations Rij = 0 require ajk^ = /zq = 0 
and this should hold for all possible y . Hence we must have, as equations of motion, 
the time-like geodesic equations 

( 3 ^= 0 , ( 11 ) 



for the world line r = 0 in the Minkowskian background space-time. This sim- 
ple example illustrates a connection between the equations of motion and the field 



268 



RA. Hogan 



equations. In general we shall find, however, that the field equations by themselves 
will not be sufficient to determine the equations of motion. They will have to be sup- 
plemented by a smoothness condition on the wave fronts of the possible radiation 
produced by the motion of the particle. A singular point on a wave front translates in 
space-time into a singularity along a generator, extending to future null infinity, of 
the null hypersurface history of the wave front and thus violating the concept of an 
isolated source. A useful example with which to illustrate this is to consider the rela- 
tivistic motion of a Reissner-Nordstrom particle moving in external electromagnetic 
and gravitational fields. 



2 Reissner-Nordstrom Particle 

We construct here a model of a Reissner-Nordstrom particle of small mass m = 0\ 
and small charge e = 0\ moving in external vacuum gravitational and electromag- 
netic fields. In this case the background space-time is a general solution of the vacuum 
Einstein-Maxwell field equations. We shall require a knowledge of this space-time, 
and the Maxwell field, in the neighborhood of an arbitrary time-like world line r = 0 
(say). The charged mass will then be introduced as a perturbation of this space-time, 
which is singular on this world line in the background space-time (in similar fashion 
to (4) in the space-time with line element (3)), in such a way that the perturbed space- 
time is, for small values of r, predominantly the Reissner-Nordstrom space-time and 
Maxwell field. Since we have indicated in Sect. 1 that we shall make use of the field 
equations along with conditions on the wave fronts of possible radiation (electro- 
magnetic and/or gravitational in the current example) produced by the motion of our 
particle, and since the histories of wave fronts are null hypersurfaces in space-time, 
we shall work in the background space-time and in the perturbed background space- 
time in a coordinate system based on a family of null hypersurfaces. Thus we begin 



by writing the line element of the background space-time as 

ds^ = + 2 ^^ # = ( 12 ) 

with the 1 -forms and d^ given by 

d^ = r p~^{e^ cosh f3dx e~^ sinh f3dy -\- a du), (13) 

d^ — r ^mhpdx -\- e~^ ^ dy -\-bdu), (14) 

d^ = dr -\ — du, (15) 

2 

d"^ = du. (16) 



The constants pab = Qba constitute the components of the metric tensor on the tetrad 
defined by the basis 1 -forms. Tetrad indices will be denoted by the early letters of the 
alphabet a, b, c, , and tetrad indices will be lowered with pab and raised with p^^ 
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where g^^gtc = element is completely general in that it involves six 

functions p, a, P, a, b, c each of the four coordinates x, y, r, u. It is thus equivalent 
to line elements constructed by Sachs [11] and by Newman and Unti [12]. It was 
constructed in [6] for the study of gravitational radiation from bounded sources and 
was specifically designed to have the Robinson-Trautman [13, 14] line elements 
appear as a convenient special case (got by simply putting a = P = 0). This latter 
property also makes it convenient in the current context. We shall take r = 0 to be 
the equation of an arbitrary time-like world line in the space-time with line element 
(12). The space-time in the neighborhood of this world line has the well-known Fermi 
property (see, for example, [15, 16]) that there exist coordinates in terms of which 
the metric tensor components satisfy 

gtj = ruj + 0{r^). (17) 

In view of the line element (5) we can implement this property in the context of the 
line element (12) by expanding the six functions appearing there in powers of r as 
follows: 



p = Foil + qir^ + q3r^ ), ( 18 ) 

a = tt2 + tt3 H , (19) 

P = p2r^+P3r^ + --- , ( 20 ) 

a=a\r-\-a 2 r +•••, (21) 

h = h\r -\- h 2 r^ ’ , (22) 

c = CQ -\- c\r -\- C 2 r^ -\ , (23) 



where the coefficients of the powers of r are functions oix,y,u. Following (5) the 
function Pq here is given by (6) and 



and 



Co = 1 = A log Pq with A = Fq 



/ \ 

dy^} ’ 



(24) 



c\ = -2/zo, 



(25) 



with /zo given by (7). The operator A is the Laplacian on the unit 2-sphere. The rela- 
tionship between the rectangular Cartesian coordinates and time and the coor- 
dinates x,y,r,u near the arbitrary time-like world line r = 0 (<^ = w\u), 

introduced in Sect. 1) is given by (see, for example, [17]) 

X^ = (u) r k\ (26) 



neglecting O(r^) -terms. 
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At this point in the discussion we should note some formulas associated with 
Minkowskian space-time with line element (5) or, equivalently, with metric tensor 
components r]ij in coordinates . A useful basis of 1 -forms suggested by (5) is 

= r Pq^cIx, uP^ = r = Jr -h — ho r^ du, = du. (27) 

When expressed in terms of the coordinates using the transformation (26) these 
read 



1 9ki ■ 

^ = -Po^dX^ = -uji, 
ox 

2 T i 

u? = -P^^dX' = -UJ2, 
ay 



Ul)' 






u>‘^ = ki dX‘ = u>3. 



dX‘ = L04, 



(28) 

(29) 

(30) 

(31) 



In coordinates X’ the vector fields with components v\k‘ , dk’’ jdx , dk' jdy form a 
basis and it is useful to express the metric tensor components on this basis as 



2 dki dk‘ dkj 






-h y — yy. 



(32) 



Also the second partial derivatives of y play a key role in the calculations and when 
they are expressed on this basis we have 



d^y 



= - k') - ^(log Po) ^ + ^(log Po) 



dx^ 

d^y 



dx ' 



dy d 

dx 9y ' 



dy 

dy 



(33) 



dy 

d^y 



2 = Pq^O - k') + ^(log Po) ^ - ^(log Po) (34) 



dx 



dx dy 



dy 



d dy d dy 

dxdy dy dx dx dy 



(35) 



As a simple indication of the usefulness of these formulas we notice that if we add 
(33) and (34) we obtain 

Ay +2y =2v\ (36) 



with the operator A defined in (24). Taking the scalar product, with respect to the 
Minkowskian metric tensor r]ij, of this equation with the 4- acceleration y yields 



Aho 2ho = 0, 



(37) 
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with h{) given by (7). This demonstrates that /zq is an / = 1 spherical harmonic. A 
spherical harmonic Q of order I is a smooth solution, for — oo < x, y < +oo, of the 
equation AQ + / (/ + 1)2 = 0 for / = 0, 1, 2, . . . 

Returning now to the construction of the background Einstein-Maxwell field, the 
potential 1-form leading to the background electromagnetic field is given by 

A = Ldx + Mdy + Kdu, (38) 

where L, M, K wcq functions of x, y, r, u. We have used the freedom to add an exact 
differential (i.e. a gauge transformation) to remove one component of this 1-form 
for convenience. The functions appearing in (38) will be expanded in powers of r 
in such a way that the resulting Maxwell 2-form F = dA, the exterior derivative 
of (38), is non-singular at r = 0, since this will be the external Maxwell field. The 
simplest expansions to achieve this are: 

L = L2 + L3 + • ■ ■ , (39) 

M = M 2 + Ms + • • • , (40) 

K = rKi+r^K2 + -- - , (41) 

with the coefficients of the powers of r functions of x,y,u. The Maxwell 2-form is 
given by 

F = JA = (42) 

where Fab = ~Fba are the components of the 2-form on the tetrad defined by the 
basis 1 -forms (13)-(16). The leading terms, in powers of r, of the tetrad components 
Fab which will interest us are, in light of the expansions (18)-(23) and (39)-(41), 

Fi3 = -2Ro^2 + 0(r), F23 = -2RoM2 + 0(r), F^^ = KiFO{r). (43) 

Thus on r = 0 we can replace the basis (13)-(16) by the Minkowskian counterparts 

(28)-(3 1) and the coordinates x , y , r, w by the coordinates following (26) to arrive 
at 



1 dk ^ 1 dk ^ 

L2 = -Fij{u)F M2 = -Fij(u)F Ki = Fij(u)F yj, (44) 

where Fij (u) = —Fji (u) are the components of the external Maxwell tensor on the 
world line r = 0 calculated in the coordinates . With given by (8) above we see 
that now the dependence of the functions L 2 , M 2 , K\ on x, y is explicitly known. 
We can therefore use the useful formulas (33)-(35) to verify that Maxwell’s vacuum 
field equations, = 0, where the star denotes the Hodge dual of the 2-form F, 
are satisfied by L2, M2, K\ . The relevant equations obtained from the leading terms, 
in powers of r, in Maxwell’s equations are 
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(45) 



and 



dKi 

dx 





<dM2 




(^0 \ 


dx 


9)' /I 


\ p^ 1 


/3M2 


dl2\] 


1 ^0 1 


i, dx 


dy J\ 



0 , 



0 . 



(46) 



(47) 



We see that the second equation in (45) is a consequence of (46) and (47) and the 
two equations in (45) imply that is an / = 1 spherical harmonic since 



A^i +2^1 = 0. 



(48) 



This latter equation can, of course, be verified directly using the last of (44) and the 
formulas (33)-(35). 

The analogue, for the gravitational field, of the Maxwell 2-form with tetrad com- 
ponents Fab given by (43) is the Weyl conformal curvature tensor with tetrad com- 
ponents Cabcd- The components which will be of interest to us in the neighborhood 
of the world line r = 0 are given by 



Ci 3 i 3 + iCi323 = 6 (a 2 + if32) + 0(r) 



and 



C 343 I + ^’C3432 = 



3 1 
-pr^ 
2 



a\ H- ib\ -h 4Pq 



0(r), 


(49) 


^1+OM. 

dC 1 


(50) 



where C = x-\-iy and a bar denotes complex conjugation. These must be examined in 
conjunction with the Einstein-Maxwell vacuum field equations for the background 
space-time: 



Pab — 2 



PcaP b ^QabPcdP 



cd 



— 2 Eab’) 



(51) 



where Rab are the tetrad components of the Ricci tensor of the background space- 
time, Qab are the tetrad components of the metric tensor and, as indicated following 
(16), tetrad indices are raised using the components of the inverse of the matrix 
with components gab and Eab = Eba are the tetrad components of the electromag- 
netic energy-momentum tensor. To satisfy the field equation R 33 = 2£’33 -h 0(r) we 
find that 

q2 = \ Pi (l\ + Ml) . (52) 



This can be rewritten, using L 2 and M 2 given by (44) and the expression (32) for the 
components of the inverse of the Minkowskian metric tensor, simply as 
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qi = -\FPi{u)Fpj{u)knK (53) 

o 

Here Fpj {u) are the components of the Maxwell tensor on the world line r = 0 in 
coordinates while F^iin) = Fji(u). Using this expression for q 2 in (50) and 
the basis (28)-(31), in similar fashion to the passage from (43) to (44) above, we 
conclude from (49) and (50) that 



«2 = Cijkliu) 



dk^ 
dx ’ 



p2 = \piCiju{u)k^ 



dy dy 

dx dy ’ 



(54) 



and 



ai = ^ Pi (Cijkt (u) k' yj + FPi (u) Fpj {u) k' , (55) 

2 / dk!^ dk ^ \ 

b\ = -Pi \ Cijki{u) k’ PpjW k' —J , (56) 



where Cijuiu) are the components of the Weyl tensor of the background space-time 
calculated on the world line r = 0 in the coordinates . With Pq given by (6) and 
y by (8), we see that the dependence of q 2 in (53), a 2 and (32 in (54), and a\ and b\ 
in (55) and (56) on the coordinates v, y is explicitly known. The Einstein-Maxwell 
vacuum field equations R \2 = 2 £’i 2 + 0 (r) and 7?n — R 22 = 2(£’n — E 22 ) + 0 (r) 
yield the pair of real field equations incorporated in the single complex equation: 



2 ( q (2 + ipi) = 



d 

dC 



a\ -h ib\ H- 4 Pq 



dq2 

dC 



(57) 



The pair of real field equations R 13 = 2£’i3 + 0(r) and R 23 = 2 E 23 + 0(r) can be 
written as the complex equation: 



ai + ih +4 Pi ^=2 Pi ^{Po~^(a 2 + */?2)}. (58) 

d( oQ 

The field equations (57) and (58) must be satisfied by ^ 2 , <^ 2 , given by (53)- 

(56). This important check can be carried out using the useful formulas (33)-(35). 
We are now ready to introduce the Reissner-Nordstrom particle as a perturbation of 
this background space-time and Maxwell field. 

The line element of the perturbed space-time is given by 



ds^ = -(t9‘)2 - (i)2)2 



(59) 
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with the 1 -forms '3^ ,'3^, and given by 

= r p~^ cosh ^ dx + e~^ sinh^dy -\-ddu), (60) 

= r p~^ (e^ sinh P dx + e~^ cosh Pdy + bdu), (61) 

= dr -\ — du, (62) 

2 

¥ = du. (63) 



To achieve our aim of having a predominantly Reissner-Nordstrom field for small 
values of r we take the functions appearing here to have the following expansions in 
powers of r: 



p = Pq{\ + qir^ + )- 

a = C(2r^ + + ■ ■ ■ , 

/3 = /32 + /3s H , 

fl-i ^ , ,2 

a = h (20 + <31 r -h (22 r + • • • , 

r 



b = + ^0 + ^ + ^2 ^^ + • 

r 

^2 



^ 2 
yL 



2{m + 2/_i) 



+ CO + Cl r + C2 r + ■ 



(64) 

(65) 

( 66 ) 

(67) 

( 68 ) 

(69) 



The coefficients of the various powers of r here are functions of x,y,u. The hatted 
functions differ from their background values by C>i -terms. Thus in particular a_i = 
Oi, «0 = 0\, b-i = O\,bo = 0\, but /_i = C>2- The perturbed potential 1-form 

A = L dx + M dy + k du, (70) 



is predominantly the Lienard-Wiechert 1-form (= c (r ^ — ho) du) up to a gauge 
term so that 



L = r^ L2 + rH3 + --- , (71) 

M = M2+r^ M3 , (72) 

K = - -eho + r Ki+r^ K 2 + --- , (73) 

r 

where again the coefficients of the powers of r here are functions of x,y,u which 
differ from their background values by Oi -terms and K-\ = O 2 . The expansions 
(64)-(69) and (7 1)-(73) appear to us to be the minimal assumptions necessary to have 
a small charged mass with gravitational and electromagnetic fields predominantly 
those of a Reissner-Nordstrom particle for small values of r. They certainly restrict 
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the model of such a particle moving in external gravitational and electromagnetic 
fields and their generalization or otherwise is a topic for further study. 

When the metric tensor given by (59)-(63) and the potential 1-form (70), together 
with the expansions (64)-(69) and (71)-(73), are substituted into Maxwell’s vacuum 
field equations and the Einstein-Maxwell vacuum field equations, and all terms are 
gathered on the left hand side of each equation with zero on the right hand side, we 
find a finite number of terms involving inverse powers of r and an infinite number of 
terms involving positive (or zero) powers of r. In an ideal world we would equate the 
coefficients of each of these powers to zero and from the resulting equations derive 
the coefficients in the expansions (64)-(69) and (71)-(73). However the reality is 
that we can at best make the coefficients small in terms of m and e. How small 
depends upon how accurately we require the equations of motion. Also the number 
of coefficients we require in the expansions (64)-(69) and (71)-(73) depends upon 
how accurately we require the equations of motion. For the purpose of the present 
illustration we shall calculate the equations of motion in first approximation and thus 
with an O 2 -error. The null hypersurfaces u = constant in the space-time with line 
element (59) are the histories of possible wave fronts of radiation (electromagnetic 
and/or gravitational) produced by the motion of the Reissner-Nordstrom particle. 
For small values of r the degenerate metric on these hypersurfaces is given by the 
line element 

dsl = -r^p^^idx^ + dy^), (74) 

where Pq differs from its background value Pq given in (6) by Oi -terms and thus 
can be written 

Po = Ro(1 + Gi + 02), (75) 

with Qi(x,y,u) = 0\. We will assume that the line elements (74) are smooth 
deformations of the line element of a 2-sphere. This means that Gi is a well behaved 
function for— oo <x,y < -hoo and thus free of singularities in x,y. Now the field 
equations yield 



a-\ = —Ae PqL 2 -h O 2 = O 2 , 


(76) 


b-i = —4e PqM 2 + O 2 = 0\, 


(77) 


cq = l-fAGi ~h 2Qi + SeFijk^ -|- O 2 , 


(78) 


;^A(A2i -h 22i) = 6maik^ - 6eFijFv^ -h O 2 . 


(79) 



Both terms on the right hand side of (79) are / = 1 spherical harmonics and thus 
(79) can be easily integrated (discarding the solution of the / = 0 spherical harmonic 
equation which is singular for infinite values of x and y) to read 



A2i + 22i = 6maik^ — 6eFijk^v^ -h A(u) -h O 2 , 



(80) 
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where A(u) = 0\ is arbitrary (an I = 0 spherical harmonic). We note that spherical 
harmonics corresponding to / = 0 or / = 1 in the perturbation Q\ in (75) are trivial, 
in the sense that the 2- sphere remains a 2- sphere under these perturbations, and can 
be neglected. Since the first two terms in (80) are both I = 1 spherical harmonics 
the solution Q\ will necessarily have a singularity in v, y (a directional singularity) 
unless they combine to vanish or be at most small of second order. Thus we must 
have 

maik^ = 6eFijldv^ + O2, (81) 

for all possible values of and thus 



mai = eFijV^ + O2. 



(82) 



We have arrived at the Lorentz equations of motion in first approximation. 

The example above serves to illustrate our approach. If it is continued to the next 
order of approximation the equations of motion which emerge are [8] 

4 2 

mat = eFijV^ H — F^ j^FpiV-^ H — -\~ e^Ti + O3, (83) 

7 3 ^ 3 

where h^j = 6^j — vj is the projection tensor and the dot denotes differentiation 
with respect to proper time u. The second term on the right hand side here is an 
O2 -correction to the external 4-force. The third term on the right hand side is the 
electromagnetic radiation reaction 4-force. The fourth term on the right hand side is 
a tail term given by an integral with respect to proper time u from —00 to the current 
proper time w of a vector whose components are proportional to the external Maxwell 
field Fij (u) and involve functions of u of integration and a pair of space-like vectors, 
which occur naturally, defined along the world line r = 0 in the background space- 
time. When ^ = 0 we obtain the equations of motion of a Schwarzschild particle in 
second approximation given by the approximate time-like geodesic equations 



mat = 03 . (84) 

There is a formal similarity between the equations of motion (83) and equations 
of motion derived by DeWitt and Brehme [18]. However DeWitt and Brehme have 
removed an infinite term from their equations of motion and have considered a 
scenario (the world line of a charged test particle in a curved space-time, and hence 
utilize only Maxwell’s equations on a curved space-time) which is different to that 
constructed here. 

Finally, so that the reader is left in no doubt as to the procedure involved in 
the passage from (80) to (82), we can give a simplified example of the procedure. 
If A = cosO for 0 < 6> < 7T (or equivalently for— 1 < A < -hi) consider a 
function / = /(A) satisfying the inhomogeneous / = 1 Legendre equation (i.e. 
the inhomogeneous I = 1 spherical harmonic equation satisfied by a function of the 
single variable A) with an / = 1 Legendre polynomial on the right hand side: 
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A/ + 2/^ ^|(1-A2)^I +2/ = fiPi(A), 



(85) 



where t\ is a constant. For this equation to have a solution which is non-singular at 
A = ibl (which are directional singularities) we must have t\ = 0 (analogous to 
(81)). This easily follows from consideration of the general solution 



where t 2 , are constants and Q \ (A) is the Legendre function corresponding to / = 1 . 
Of course the required directional singularity-free solution also requires us to take 
^3 = 0 here. 



3 Kerr Particle 

We outline here the application of our technique to the construction of a model of 
a spinning particle, or Kerr particle, moving in an external vacuum gravitational 
field, and the derivation of its equations of motion. This work, carried out by Shinpei 
Ogawa and the author, is reported in [10]. Although we consider only the equations 
of motion in first approximation here, the calculations are more intricate than those 
necessary for the study of the Reissner-Nordstrom particle. Consequently we must 
refer the interested reader to [10] for further details. 

The fundamental aspects of our technique are the construction of a background 
space-time with an arbitrary time-like world line in the neighborhood of which the 
space-time is fiat, following from the Fermi property, and then the introduction of 
the particle of interest as a perturbation of this space-time. In the fiat neighborhood 
of the world line in the background space-time we wrote the line element in the 
form (5) which introduces the 4- velocity and 4-acceleration of the world line into 
the line element. To consider a particle with spin in this context we introduce the 
spin variables (what will become the three independent components of the angular 
momentum per unit mass of the particle) into the line element of Minkowskian 
space-time to accompany the 4-velocity and 4-acceleration already present. We do 
this in such a way that if =0 then the form of the line-element of Minkowskian 
space-time coincides with the Kerr line element, with three components of angular 
momentum per unit mass, in the special case in which the mass m = 0. The latter form 
of the Kerr line element can be found in ([19], p. 37). Let {u) be the components 
of the spin vector in coordinates X\ defined along the arbitrary time-like world 
line (u) to be everywhere orthogonal to the 4-velocity (u) and Fermi 

transported, thus. 




( 86 ) 



a A 

Vi = 0 and — = —(ajS-^)s , 



du 



(87) 
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respectively. The equivalent spin tensor is defined by 

^ij — Qjki V s = ~Sji, (88) 

where eiju are the components of the Levi-Civita permutation tensor in coordinates 
. We introduce s^ into the Minkowskian line element by replacing (26) by 

r =w^{u) + re + P^y—Fy-—F^y (89) 

where 

F = s^ki, (90) 

and the subscripts on F denote partial differentiation with respect to v and y . We note 
that Pq and are given by (6) and (8). By (87) and the useful formulas (33)-(35) 
we easily find that F is an / = 1 spherical harmonic. Also we can rewrite (89) in the 
general form of (26) as 



= w^{u)^-rK^ with 



rfJ + - s^^ 
i r 



) 



(91) 



showing that for large r in Minkowskian space-time only differs from by an 
infinitesimal Lorentz transformation generated by the spin tensor. The Minkowskian 
line element is given by ds^ = rnj dX^ dX^ with X^ given now by (89). When 
X^ = w^(u) (no longer corresponding to r = 0) is a geodesic we can choose = 6^^ 
and the resulting line element is 



dsQ = — (r^ + F^)pq ^(dx^ -h dy^) + 2JE 



dr H — (du 
2 



Fydx -h Fxdy) , 

(92) 



with po given by (2) and 



JE = du — Fydx -h Fxdy. (93) 

We note that the latter is not an exact differential unless the spin vanishes. The Kerr 
solution with the three components of angular momentum (ms^ , ms^, ms^) (since 
now s^ = 0) is 

ds^ = dsQ (94) 

If the world line X^ = w^{u) not a time-like geodesic then the Minkowskian 
line element is algebraically more complicated than (92). For simplicity we shall 
henceforth neglect spin-spin terms and assume that the 4-acceleration is proportional 
to the spin on the basis that we expect geodesic motion if the spin vanishes. Now the 
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Minkowskian part of the background line element, when the world-line = w^u) 
is not necessarily a time-like geodesic, is given by 



dsn — —r P( 



2 d-2 



2P^ F, 



2P^ F, 



dx^ + dy^ H ^ dx du ^ dy du 



+ 2 Fy du dx — 2 Fx du dy + {I — 2 ho r) du^. 



2dr JE 

(95) 



With this preparation we write the line element of the general background space-time 
as 



^ {e^ cosh (3dx -\- e ^ sinh jSdy + a JE)^ 


sinh/3(/x + e “cosh/3Jy + ZjJS)^ 


■ 3-2dr + cdYi, 




(96) 


p = Po {\ + q2r^ + 3 ), 


(97) 


2 

a = a\r + a 2 r + • • • , 


(98) 


(3 = f3\r + (32r^ 3 , 


(99) 


p 2 Fy a-i 

a — ^ -h -\- cio -\- u\ r ’ 

yL y 


(100) 


Pi Fx b-i 

b = + 3-b(,3-b\r 3- 

yZ y 


(101) 


c = coF C\r 3 


(102) 



We now require this background line element to be a solution of Einstein’s vacuum 
field equations 

Rab = 0. (103) 



Equating to zero the powers of r in the Ricci tensor components Rab we find the 
following expressions for the coefficients of the powers of r in the expansions (97)- 
( 102 ): 

qi = 0 = q 2 , a\ = AF (32, f3\ = —4Fa2, (104) 



1 dk^ dk^ 

a2 = -P^ Rijktiu) + 0(F), (105) 

132 = -Pi Rijkiiu) —y—k^ + 0{F). 



(106) 
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Also a-i = 0 = b-\ and 

ao = -lP^{a2Fy - bo = -lP^{a2F, + p2Fy), (107) 



2 9 i i F dk^ 




ai = -P^ Rijkiiu) k’ vn’^ — F 0(F), 


(108) 


2 

bi = -P^ Rijkiiu) k‘ vj k’^^F OiF), 


(109) 



and Co = 1 while 

Cl = -2/10 - FPi (J-(p-^ai) - ^(P^^bi)^+5{aiFy-biF^). (110) 

The 0(F)-terms not calculated here will get multiplied by F, as in (104), (107) and 
(110) and will therefore not contribute to the end result since we are systematically 
neglecting spin-spin terms. In these formulas Rijkiiu) are the components of the 
Riemann curvature tensor in coordinates calculated on the world-line = w^{u). 

As well as satisfying the algebraic symmetries of the Riemann tensor they satisfy 
Einstein’s vacuum field equations rj^^Rijuiu) = 0 on the world-line X^ = w^(u). 

We introduce the Kerr particle of small mass m = Oi as a perturbation, which is 
predominantly the Kerr field for small values of r, of this background vacuum space- 
time. The simplest way to achieve this appears to be the expansions (remembering 
that we make no claim to uniqueness; see the comments following (73) above): 






2 --2 
— r p 



(c^ cosh /3 Jv -h c ^ sinh/3 Jy -h a JE)^ 



-h(c^ sinh/7 Jv -h c ^ cosh/7 Jy -h ^ JE)^ 



-|- 2 dr dXj -\- c dXj^ , 



( 111 ) 



with 



p = Po{\ F q2V^ F ^ ). (H2) 

a = air F oi2r^ F ■ • • , (H3) 

(3 = (3ir F P2r^ , (114) 

^ P} Fy 0.-1 ^ , 

a = — ^ 1 \- -\- ci\ r ' , (115) 

r 



Equations of Motion of Schwarzschild, Reissner-Nordstrom and Kerr Particles 



281 



P}F^ b-i ^ 

b = 1 \-bo-\-b\r , 

y. 

^ 2m 

c = h CO + Cl r H 

r 



(116) 

(117) 



As always the hatted functions of x,y,u differ from their background values by 
Oi -terms. In addition Pq again has the form (75) involving the first order function 
Qi(x, y, u). Now we must impose the vacuum field equations to be satisfied by the 
metric given via the line element (111) with sufficient accuracy to enable us to derive 
the equations of motion for the world line = w^(u) in the background space- 
time, in first approximation. The vacuum field equations provide us with a set of 
equations which parallel (79)-(81) in the Reissner-Nordstrom example. In addition 
to qi = 02,q2 = O 2 we find that 



ao = -7PQ(a2Fy ^ 2 Fx) 2ma\ -h 0{mF) -h O 2 , (118) 

foo = -1PI{ol2F^ + ^Fy) - 2mbi + 0{mF) + O 2 , (119) 

a-\ — —AmFb\ + O 2 , b-\ = AmF a\ -h O 2 , (120) 

and 

-%m {axFy - biF^) F O 2 . (121) 

In the remaining field equation, which yields the analogue of (79) and thence the 
analogue of (80), the functions uq and b^ are multiplied by m and so the uncalculated 
terms 0(mF) in (118) and (119) will not contribute to the equations of motion in 
first approximation (i.e. neglecting 02-terms). The equation we obtain, which is the 
analogue of (80), is 



(A -|-2)(2i -|-2m/i +m/ 2 ) = 6maik^ —6mRijki(u)k^ A- A(u) A- O 2 , (122) 



where A(u) = Oi is a function of integration as in (80) and J\ and J 2 are spin- 
curvature terms given by 



= PqF Rijki{u)k‘v 



j dk^ dk^ 
dy dx 



+ ^Rijki{u)k’- 



which is an / = 3 spherical harmonic and 



. / Q]^l 

h = Pq RijklWSmV' y 



dk^ dk"‘ \ 
dy dx / ’ 



(123) 



( 124 ) 
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which is an / = 2 spherical harmonic, results which are obtained using the useful 
formulas (33)-(35). The first two terms on the right hand side of (122) are / = 1 
spherical harmonics and so, just as in the case of (80), the solution Q\ will be free 
of directional singularities provided the sum of these terms is zero or at most small 
of second order, i.e. 



m at = m Rijuiu) + O 2 , (125) 

for all possible and thus we arrive at the equations of motion of the Kerr particle 
in first approximation: 



mat = m Rijkiiu) + O 2 . (126) 

The algebraic form of the right hand side of these equations is, perhaps, what one 
would expect. The numerical factor of unity distinguishes these equations of motion 
from those of Mathisson-Papapetrou for a spinning test particle. We have here a 
model of a Kerr particle and it is manifestly not a test particle. The rest mass m is 
already small and making it smaller does not make it a test particle since so long as 
m / 0 its presence perturbs the background space-time (96). However in the present 
context we can say something about spinning test particles and this is the topic we 
turn to now. 



4 Spinning Test Particle 

Since we are now concerned with test particles moving in vacuum gravitational 
fields we will concentrate on what we called background space-times above. We first 
consider the non-spinning test particle whose world line, according to the geodesic 
hypothesis, is a time-like geodesic. With the geodesic hypothesis accepted for a non- 
spinning test particle it is relatively easy to devise a strategy, using the formalism of 
this paper, which will lead us to the equations of motion of a spinning test particle. 
As in Sect. 3 we will neglect spin-spin terms, but this restriction can be relaxed (and 
indeed has been by Florian Bolgar in his Internship Report to the Ecole Normale 
Superieure (2012)). The background vacuum space-time with the spin vector = 0 
has line element given by (96)-(102) with F = 0 and the coefficients of the powers 
of r are given by (104)-(110) with F = 0. Thus in coordinates = (v, y, r, u), 
with i = 1, 2, 3, 4, the only coordinate component of the metric tensor involving the 
4-acceleration of the world line (u) is 



544 = 1 - 2rho + O(r^). 



(127) 
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The coefficient of —2r on the right hand side here is the / = 1 spherical harmonic 
/zQ, which vanishes if and only if = 0. In this case the world line {u) is 

a time-like geodesic and is thus the history of a test particle moving in the vacuum 
gravitational field modeled by the space-time with line element (96) with F = 0. 
Now we consider a particle with spin 7 ^ 0 having world line {u) in the 

space-time with line element (96) with F 7 ^ 0. Now (127) is replaced by 

544 = 1 - 2r (ho - ^Rijki(u)k‘vjs’^‘ + + 6>(r2), (128) 

with J 2 given by (124). The first two terms in the coefficient of — 2r here are / = 1 
spherical harmonics while the third term /2 is an / = 2 spherical harmonic. If the 
equations of motion of a spinning test particle are obtained in the same way as those 
of a non-spinning test particle then equating to zero the / = 1 terms in the coefficient 
of — 2r in (128) results in 

a; k’ = ho = ^Rijki(u) k‘ (129) 

for all possible values of and thus the equations of motion of a spinning test 
particle, neglecting spin-spin terms, are 

ai = ]^Rijki{u)vh’^\ (130) 

in agreement with Mathisson [20] and Papapetrou [21]. 



5 Discussion 

Perhaps the most striking aspect of the technique presented here for extracting equa- 
tions of motion from field equations is the absence of infinities arising in the cal- 
culations, either in the form of infinite self energy of the Dirac type [18] or in the 
form of divergent integrals. We have already indicated following (84) above that 
in the case of the DeWitt and Brehme [18] work this may be due to the fact that 
they have considered a different problem to the one we have described in Sect. 2. 
Much work in recent years has been done under the general heading of “the self 
force problem” and, fortunately, this work is well represented in the current volume, 
offering the interested student a direct comparison with our work and the challenge 
of connecting the different points of view. 



284 



RA. Hogan 



References 



1. I. Robinson, J.R. Robinson, in General Relativity, ed. by L. O’Raifeartaigh (Clarendon Press, 
Oxford, 1972), p. 151 

2. RA. Hogan, M. Imaeda, Equations of motion in linearized gravity: I uniform acceleration. J. 
Phys. A: Math. Gen. 12 , 1051 (1979) 

3. RA. Hogan, M. Imaeda, Equations of motion in linearized gravity: II run-away sources. J. 
Phys. A: Math. Gen. 12 , 1061 (1979) 

4. RA. Hogan, M. Imaeda, On the motion of sources of some Robinson-Trautman fields. J. Phys. 
A: Math. Gen. 12 , 1071 (1979) 

5. RA. Hogan, G.M. O’Brien, On the motion of rotating bodies in linearized gravity. Phys. Lett. 
A 72 , 395 (1979) 

6. RA. Hogan, A. Trautman, in Gravitation and Geometry, ed. by W. Rindler, A. Trautman 
(Bibliopolis, Naples, 1987), p. 215 

7. RA. Hogan, I. Robinson, The motion of charged particles in general relativity. Pound. Phys. 
15 , 617 (1985) 

8. T. Putamase, RA. Hogan, Y. Itoh, Equations of motion in general relativity of a small charged 
black hole. Phys. Rev. D 78, 104014-1 (2008) 

9. H. Asada, T. Putamase, RA. Hogan, Equations of Motion in General Relativity (Oxford Uni- 
versity Press, Oxford, 2011) 

10. C. Barrabes, P. A. Hogan, Advanced General Relativity: Gravity Waves, Spinning Particles, and 
Black Holes (Oxford University Press, Oxford, 2013) 

11. R.K. Sachs, Gravitational waves in general relativity VIII. Proc. R. Soc. (London) A 270 , 103 
(1962) 

12. E.T. Newman, T.W.J. Unti, Behavior of asymptotically fiat empty spaces. J. Math. Phys. (N.Y.) 

3 , 891 (1962) 

13. I. Robinson, A. Trautman, Spherical gravitational waves. Phys. Rev. Lett. 4, 431 (1960) 

14. I. Robinson, A. Trautman, Some spherical gravitational waves in general relativity. Proc. R. 
Soc. (London) A 265 , 463 (1962) 

15. L. O’Raifeartaigh, Permi coordinates. Proc. R. Irish Acad. Sect. A 59, 15 (1958) 

16. L. O’Raifeartaigh, Riemannian spaces of N dimensions which contain Permi subspaces of N-1 
dimensions. Proc. R. Irish Acad. Sect. A 62 , 63 (1962) 

17. E.T. Newman, T.W.J. Unti, A class of null fiat-space coordinate systems. J. Math. Phys. (N.Y.) 

4, 1467 (1963) 

18. B.S. DeWitt, R.W. Brehme, Radiation damping in a gravitational field. Ann. Phys. (N.Y.) 9 , 
220(1960) 

19. C. Barrabes, RA. Hogan, Singular Null Hypersurfaces in General Relativity (World Scientific, 
Singapore, 2003) 

20. M. Mathisson, New mechanics of material systems. Acta Physica Polonica 6, 163 (1937) 

21. A. Papapetrou, Spinning test-particles in general relativity I. Proc. Roy. Soc. (London) A 209 , 
248 (1951) 



From Singularities of Fields to Equations 
of Particles Motion 



Y, Itin 



Abstract Due to the well known Einstein’s proposition, the non-linearity of the 
GR field equations allows to derive, from the singularities of the field, the geodesic 
principle i.e., the equations of motion of massive pointwise particles. In this paper, 
we illustrate how this construction can be realized explicitly in a simple case of a 
non-linear scalar field model. For a field singular at one point (a timelike curve in 
4D description), we derive the inertial motion law. For a field with two singularities 
(two disjoint timelike curves), we obtain, in the lowest approximation, the second 
law of non-relativistic dynamics together with the proper expression of Newton’s law 
of attraction. The ordinary used method in such type of derivation is the integration 
over a tube near the singular line. Instead, we are working with the singular terms 
themselves. We compare the terms of the field equation that have the same order of 
divergence. The dynamical equation is derived as the relation between the coefficients 
of two leading singular terms — the agent of inertia and the agent of interaction. 



1 Introduction 

It is usually declared that general relativity (GR) is unique and completely differ- 
ent from other classical field theories in its treatment of the particle motion. For a 
comparison, in linear classical field theories, such as Maxwell’s electrodynamics, the 
equations of motion of the electric charges can and must be postulated independently 
from the field equations. It is in a contrast to GR where, due to Bianchi identities, 
the trajectories of sources cannot be assumed completely arbitrary and thus must 
be treated as a type of consequence of the field equations. The ordinary assertion is 
that this difference is related to the fact that Maxwell’s equations are linear while 
Einstein’s equations are non-linear. Namely the non-linearity forbids the unphysical 
solutions and thus turns out to be a positive feature of a good physical theory. 
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It was stressed by Einstein, Infeld, Fock and others, that the treatment of the 
equations of motion is one of the key achievement of GR. The original Einstein’s 
proposal can be formulated as follows: 

The equations of motion of point-wise particles is completely determined by the 
source -free field equations and directly derived from them. 

In other words, the assertion is that it is possible to prove the implication 

Rij =0 UiDjU^ = 0 . ( 1 ) 

Here, the first equation is the source-free Einstein’s equation. The second equation 
is the geodesic one. In the first approximation, being multiplied on the mass of the 
test particle, it is expressed in the form of the second law of dynamics, ma = F. 

Observe that these two systems of equations are essentially different one from 
the other in their mathematical structure. In the left hand side of (1), we have the 
GR field equations that are combined into a system of 10 partial differential equa- 
tions for 10 independent fields gij{x^). In a contrast, the equations of motion in 
the right hand side of (1) are dealing with a tangential vector of a curve in i.e., 
with a vector- function of one real variable u\s). For N particles considered on a 
special-relativistic background, these equations constitute a system of 4N ordinary 
differential equations. In other words one of the problems that two equations in (1) 
are dealing with different sets of variables. 

A relation between these two distinct systems is provided by Einstein ’s model of 
a particle. A point-wise particle is assumed to be represented by a solution of the 
field equation that is singular on some curve — a wordline of the particle. A system 
of two particles is represented by a solution that is singular on two separated curves. 
Certainly it is assumed that the different components of the physical field have the 
same domain and consequently the same singular curves. 

Although, the implication (1) includes two pure geometric covariant equations, 
the proof of it was constructed with different approximation procedures [1-10]. The 
most known approximation procedure is based on the Einstein-Infeld-Hoffmann 
(EIH) equations. They were even extended to high order PPN approximations of the 
A-body problem [11-14], for calculation of the gravitational radiation reaction force 
[15] and for describing the motion of a gyroscope [16]. 

A new type of the proof that has proposed recently, see [17, 18], is based on 
a limiting procedure instead of an approximation scheme. These derivations are 
dealing with a full Einstein’s equations including the energy-momentum tensor in 
the right side. In fact, only the existence of a generic energy-momentum tensor 
satisfying some natural conditions is required. Such consideration, however, is far 
from the original (source-free) Einstein’s proposal. Another type of an exact (non- 
approximated) proofs [19, 20] are based on the notion of distributions and use the 
symmetry group arguments. Also here the full Einstein’s equation is considered and 
the equations of motion are derived, in fact, from the energy-momentum tensor. For 
a complete review of the methods used in different proofs of the geodesic principle 
and on their logical meanings, we refer to [21]. This paper also includes a vast list 
of relevant literature. 
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In the current paper, we will show by elementary examples how, in principle, 
equations of particles motion can be identified into the field equations and how 
they actually can be extracted. To these purposes, we restrict ourselves to a scalar 
field equation (linear and non-linear). Moreover, we are looking for the lowest order 
approximation of the geodesic equation — Newton’s law of dynamics together with 
Newton’s expression of the force of attraction. In other words, instead of discussing 
the implication (1) we will study the implication 

= 0 mx = F. (2) 

Here and in the sequel, the Roman indices denote the components of four-dimensional 
vectors and tensors. The three-dimensional vectors are denoted by bold letters. We 
call the term mx the agent of inertia, the force term F is referred to as the agent of 
interaction. So our problem is to identify these two agents in the field equation and 
to derive the equation between them. 

The paper is organized as follows: In Sect. 2 we consider a linear scalar Lorentz 
invariant field model and derive the inertial motion of its 1 -singular point solution. 
In Sect. 3, the non-inertial motion is discussed. We derive an exact solution with one 
singular point and an approximated solution with two singular points. We show that 
the balance between the singular contributions to the field equation yields, in the first 
order, the Newton law of attraction. 



2 Inertial Motion as Derived from a Field Equation 



First we must decide which class of the field equations is suitable for derivation of the 
equations of motion. In this section, we discuss a scalar field model which singular 
solution represents inertial motion of a single particle. We also make a first attempt 
to describe fields that are singular along curved trajectories. 



2.1 N on-relativistic Scalar Field Model 



As an introductory example, we consider the field equation 



d^(b d^d) d^6 



( 3 ) 



for a scalar function (j){t,Y) = (j){t,x,y,z). This model can be viewed as the New- 
tonian gravity formulated as a field theory. The field equation (3) does not include 
time derivatives thus it cannot yield an agent of inertia. It has however some properties 
that will be instructive for our consideration. 



288 



Y. Itin 



The solution of (3) of the form 

vn / 

</>(r,r) = — , r = ||r|| = +))2 _|_j2 (4) 

r ’ 

is singular at the origin of the coordinates and have a free parameter m of the length 
dimension. When a system of units with Newton’s constant equal to one is chosen, 
we are able to interpret this solution as a model of a point- wise particle localized at 
the origin with the mass m. Since Eq. (3) is independent of the time coordinate, it 
has a more general solution 



r) 



m 

l|r-V’(OII 



(5) 



with an arbitrary vector function Such a solution is defined for every r 7 ^ '0(0 
so it can be viewed as a model of a particle of the mass m moving along the curve 
r = 0(0- Since this curve is arbitrary, the model (3) with the solution (5) does 
not have a lot of physical sense. It is well known that, in Newton’s gravity, the 
arbitrariness of the function 0(0 in the solution (5) is restricted by an additional 
axiom, which is Newton’s law of attraction. Only under these requirements, a proper 
trajectory r = 0(0 is derived. 



2.2 Dynamic Singular Ansatz — a First Attempt 



We will discuss now another example of a field equation. In a close similarity to 
Maxwell’s electrodynamics, we consider a relativistic scalar model 

d^d) 

This equation includes the second order time-derivative, we need it in order to con- 
struct the agent of inertia. 

Similarly to the non-relativistic case, Eq. ( 6 ) also has a static solution 



m 

(h{t,Y) = , 



(7) 



that is singular at the origin. We are looking now for a more generic time-dependent 
solution of Eq. ( 6 ). As in Eq. (5), we can try a generic ansatz 



0(r,r) 



m 

l|r-0(Oir 



( 8 ) 
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This solution, if it exists for some non-trivial function will be defined on a 
set \ {r — -0(0 = 0}. Recall that we treat the exceptional set r = -0(0 as the 
trajectory of a particle of the mass m. For instance, with -0(0 = + \t, Eq. (8) 

must represent a field of a free particle. We will see shortly that it is not a case. 

We substitute (8) into (6). For an arbitrary function we have 



)=0. (9) 

Vllr-V^(0ll/ 

In order to calculate the time derivatives, we first observe an identity that holds for 
an arbitrary vector function a = a(r, r) 



Here and in the sequel, < , > denotes the scalar product of two 3 -dimensional vectors. 
Consequently, the time-derivatives read 



9 / 1 \ <-0(0, r — > 

Vl|r- -0(011/ ||r-0(OII^ 



( 11 ) 



and 



/ 1 \ ^ -||0(f)|p ^ <-0(O,r-0(O>^ 

5O\||r-0(O||/ ||r-0(O||3 ||r-0(OI|5 

( 12 ) 



We rewrite this expression as 



<m0(r),r-0(r)> 3 <-0(0, r - 0(f) -||-0(OlPl|r - 0(OlP 

Uo = ^ h m ^ . 

||r-0(OlP ||r-0(OI|5 

(13) 

In the right hand side of Eq. (13), the first term is exactly what we need as an agent 
of inertia. The additional terms must be equal to zero. It is true, however, only for 
= 0. Indeed, it is easy to check that even in the simplest case of the free motion 
trajectory, = Vo+yt, these terms give an expression which is non zero for v / 0. 
It means that (8) is not a solution of Eq. (6) and we must look for a modification of 
our ansatz. 

There is a principle reason why our simple ansatz (8) cannot make the job. The 
field equation (6) is relativistic invariant, but the ansatz (8) can be considered as be 
obtained by the non-relativistic transformation of the static ansatz. 



290 



Y. Itin 



2.3 Lorentz Transformation of a Static Singular Solution 



Our problem can be formulated now as follows: We are looking for a solution of 
the equation (6) that represents inertial motion of a particle by the singularity of its 
solution. Let us are consider a more complicated ansatz 



0(r, r) 



m 

l|R(^r)|| 



(14) 



with an unknown vector function R{t,r). Our requirements for R{t,r) are as follows : 

(1) The function R{t,r) must be regular in the whole space except for one singular 
curve. 

(2) R(t,r) must approach zero at the space infinity and at the time infinity. 

(3) In the static case ^ = 0, the vector R(t,r) must be equal to (r — r^). 

(4) The free-motion trajectory r(t) = Vo + yt must be a solution of the equation 
R(t, r) = 0. 

The simplest way to derive a suitable expression for R(t,r) is to start with a static 
solution and apply to it the Lorentz transformation with a constant velocity v. Most 
textbooks restrict to the one-dimensional Lorentz transformation. In particular, when 
a reference system {t,x,y,z} moves with a velocity v parallel to the v-axis of the 
rest reference system {L y, z}, the corresponding Lorentz transformation is given 
by 



X = f3(x + vt), y = y, z = z, t = /3{t + vx), (15) 

with the Lorentz parameter 



= 




(16) 



Recall that we use here and in the sequel a system of units with c = 1. Since we 
want to describe a more generic trajectory of a particle, we need Lorentz transforma- 
tion with an arbitrary directed velocity. Observe that two Lorentz’ s transformations 
with orthogonal velocities do not commute, so a general transformation can not be 
generated by a successive application of three orthogonal transformations (relative 
to three axes). The transformation with an arbitrary directed velocity can be derived 
as follows, see [22]. 

Consider a reference system which axes are parallel to the corresponding axes 
of a rest reference system. Let the origin of the first reference system moves with 
an arbitrary directed velocity v. Consider a position vector r directed to an arbitrary 
point. Its projection on the direction of v reads 



PyY = V 



< V, r > 



|2 • 



(17) 
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Fig. 1 The vectors r and f 
are exhibited as the sum of 
their tangential and normal 
parts 




We exhibit the vector r as the sum of its tangential and normal parts (see Fig. 1). 

r = Pyr + N^r, jV^r = r - v ■ (18) 

IMP 

Due to Eq. (15), for a Lorentz transformation directed by v, these two parts trans- 
form as 



P^r = PiPyV -h \t), and N^r = N^r. 
Consequently, the transform of the spatial vectors takes the form 
f = Fyf -h NyV = f3(Pyr -h \t) -h NyV, 

or, explicitly, 

f = r + V (Pt — a < r, V >) . 

Here two Lorentz parameters are 






1 






a 



1-/3 



|2 • 



(19) 



( 20 ) 



( 21 ) 



( 22 ) 



The change of the time coordinate is also governed only by the tangential part of the 
vector r 



i = p(t + llPvrII ||v||) = P(t+ <Pyr,\>), 



(23) 



or, explicitly. 



t = 13 (t <r,\>) . 



(24) 
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Therefore, an arbitrary directed Lorentz transformation takes the form 

t = /3(t+ 

r = r -\-y ((3t — a <r,y>) . ^ 

In the special case of a velocity parallel to the axis x, the relations (25) reduce to the 
ordinary form of the Lorentz transformation (15). 



2.4 A Dynamical Singular Solution 

We consider now an ansatz 



0(L r) 



m 

l|/?(Lr)||’ 



(26) 



where R{t,r) is the Lorentz transformation of the vector (r — r^,) with a constant 
velocity (— v), i.e., 



R(t, r) = (r - r^) - av < v, (r - r^) > ~(3yt. (27) 



Here a and [3 are Lorentz functions of the velocity v given in Eq. (22). 

First we check that (26) indeed a solution of the field equation (6). For the time 
derivatives of the vector /?(L r), we have 

R{t, r) = -py, R(t, r) = 0. (28) 



Consequently, due to (10), the time derivatives of the field are 



dcj) 



= —m/3 



<R,y> 



(29) 



and 



5^0 

dt^ 




<R, y>^ 




(30) 



In order to calculate the spatial derivatives of the field (26), we introduce a set of 
unit vectors e\,e 2 ,e 3 directed along the axes x, y , z respectively. Using Eq. (10) the 
X -derivatives of the vector (27) is written as 



R^ = ei - ay < v, ei > R^^ = 0. 



(31) 
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The second order derivative of the field takes now the form 



Consequently, the three dimensional Laplacian of 0 is expressed as 
A<p=-^{\\R,\\^ + \\Ry\\^ + \\R,\\^) + 



(32) 



m 
3 — ^ 



d<Rx,R>^ + <Ry,R>^ + < Rz, R >^) 



(33) 



Since 



< Rx,R > = < R, e\> —a <\, ei><\, R>, (34) 

we get 

< Rx, R>^ -\- < Ry, R>^ -\- < Rz, R>^ = 

R^ - 2a <\,R>^ +a^\\\\\^ <\,R>^, (35) 



l|/?xll^ + l|/?yll^ + l|/?zlf = 3 - 2a||v||2 +a2||v||^ (36) 

Thus the three dimensional Laplacian of the field (p remains in the form 

A<^= |^(a2||y||2_2a)(3 <y,R>^ -||v||2||/?||2). (37) 

With (30) and (37), we write the full d’Alembertian of the field as 

n4>=^(p^ + 2a-a^\Mp ( 3 < v ,/?> 2 -|| v || 2 ||/?|| 2 ). ( 38 ) 

In the last steep, we use the expressions (22) for the functions a, (3 to derive straight- 
forwardly 



/3^ + 2a-a^\\\\\^ = 0. (39) 

Consequently, the 1 -singular ansatz (26) with (27) is indeed a solution of the linear 
equation □0 = 0. 

Let us check now what is the trajectory of the singular point related to this solution. 
It is encoded implicitly in the equation R(t,r) = 0, i.e.. 



(r — r^,) — a\ <v, (r — r^^)> ~P\t = 0. 



(40) 
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Multiplying both sides of this equation by vector v we obtain a scalar equation 



<v, (r r^)> = 



/3||v|P? 

1 - a||v||2 



= l|v||2r. 



(41) 



We substitute this expression into Eq. (40) and obtain 



r-ro 



(a||v||2+/)) 



\t = \t. 



(42) 



It means that the singular set of the field <p coincides with the inertial trajectory. 
Thus starting with the singular solution of the linear equation □ 0 = 0 we derive the 
inertial motion trajectory. 



3 Non-inertial Motion 

Since it is imposable to get from the linear equation something more than the inertial 
motion, we turn to a non-linear scalar model. Its singular solutions must represent 
particles moving along curved trajectory. We identify the agent of inertia and the agent 
of interaction and derive, in the lowest approximation, Newton’s law of attraction 
between two singularities. 



3.1 Field Ansatz and Agent of Inertia 



We are trying now to extend our results to a non-inertial motion. In general, it can 
be expressed by a curved trajectory of the form r = -h '0(0- In the inertial case, 
= \t with a constant velocity v, we must return to the results from the previous 
section. Thus, in order to model a field of a single particle moving along a curved 
trajectory, we consider the ansatz 



4>{t, r) 



m 

l|/?a,r)||- 



(43) 



Since the velocity of the point- wise singularity is not a constant now, we cannot use 
the Lorentz transformation directly. We are able to use it, however, in the infinitesimal 
sense. Similarly to (27), we assume 



dR{t, r) = Jr — a'll) < -0, Jr > 



(44) 
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Here a and [3 are Lorentz’s functions of the velocity | IV^I P 

\ + + + ( 45 ) 

Vi-ii^iP 

and 

“ = 77^77 -1 (l - ^11-011^ - ^11-011'^ + •••) • (46) 

llV^lr z \ z s / 

The function R(t,r) is presented now in the form of an integral expression of the 
unknown function In the case of the slow velocities of the particles, this techni- 
cal difficult can be easily resolved. So we restrict our calculations to the condition 
I I ^ 1 and remove all terms of the order 0(| l-^l p )and higher. In Eq. (44), we 
can delete now the middle term and assume P = 1. The remaining equation is easily 
integrated and we come to our old non-relativistic ansatz 



Pit, r) 



m 

||r-r^- V^(OlP 



(47) 



Recall that in order to use this ansatz, we must restrict to the lowest approximation. 
We are in a position now to generate an agent of inertia — the mass-times-acceleration 
term. Using the expression (13) and removing the terms of the order 0(\ IV^I P), we 
remain with 



□ Pit, r) 



m 

l|/?a,r)|p 



<ip, Rit, r) > . 



This is a form of our agent of inertia. 



(48) 



3.2 Agent of Interaction 

We are looking now how the interaction between two particles (two singularities of 
the field) can be generated by the field equation. It is clear that, for a linear equation, 
two distinct singular solutions are combined into a solution with two singular points 
without any interaction between them. Thus we need a non-linear operator acting on 
the field. Consider a Lorentz invariant term of the form 
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For a field of the form, <p = this expression gives 

+ l|V^^Vll^ + 2 • (50) 

Consider a case when 0) ^ and <p are two solutions of the field equation that contains 

the term (49). Now, the first four terms of (50) are canceled and we remain with 

2 . (51) 

This expression includes symmetric contributions of two singularities, so it can be 
interpreted as the interaction term. 

We consider first what is going in the case of 1 -point singularity. Let us calculate 
the expression (49) for the ansatz (47). Since the time derivative term of the field (47) 
is of the order 0(| IV^I p), it can be neglected. As about the spatial derivative term, it 
takes the form 



l|V0||2 



l|/?(?,r)||4- 



(52) 



This term remains non-zero even in the case of the inertial motion. Thus we must 
fund out a way how to correlate two necessary terms: The inertia term (48) and the 
interaction term (49). 



3.3 Non-linear Scalar Equation 



We consider a non-linear Lorentz invariant scalar equation for a scalar field cp = (p{x) 
given on a flat Minkowski space 



T] 



a[3 



d'^if 

dx^dx^ 



dx'^dxp) 



It is equivalent to 



(53) 



(54) 



Here ^ is a dimensionless constant. We will discuss its value in the sequel. Let us 
derive now a static solutions of Eq. (53) with arbitrary values of the parameter k. 
First, we redefine the scalar field as 




(j) = 



(55) 



It follows that 
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(56) 



Thus, under the transformation (55), the nonlinear equation (53) is transformed to the 
linear one □0 = 0. Consequently, every solution of the linear equation 0 generates 
a solution ^ of the nonlinear equation (53). 

Starting with a function of the form 



we come to a 1 -singular solution of Eq. (53) 



For an arbitrary value of the parameter k, this solution vanishes at infinity as ^ 
m/r. Also, in the limiting case ^ ^ 0, the solution (58) approaches the Newtonian 
potential. These two limits hold independently of the sign of k. We observe, however, 
that for ^ > 0 the solution (58) is defined only for ||r — roll > km. this case, 
we have some type of a scalar black hole. For k < 0, (58) is singular only at a 
point r = ro and smooth at all other points. We will see in the sequel that namely 
the negative values of the parameter k yield the attraction between the singularities, 
while the positive values of k yield their repulsion. 

We consider now a non-static ansatz of the form (58) 



As we have already seen, it is enough to restrict, in the lowest approximation, to the 
denominator 



Substituting this expression into Eq. (53) and removing the terms of the order 
0(1 1-01 p) we obtain 




(58) 




(59) 



R(t,r) = r-ro-0(O. 



(60) 




( 61 ) 



This is our new agent of inertia. 
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3.4 An Interaction Is Generated 



Consider first a static solution of Eq. (53) with two singular points. In the lowest 
approximation, we may use the superposition of two individual static solutions, 

1 / mi \ 1 / mo \ 

m = — In ( 1 -k ^ I - - In 1 1 -k ^ — ). (62) 

^ k \ ||r-n||; k V I|r-r 2 ||/ 

Here r is a generic point of observation, while ri , ro are the positions of two singular 
points with the masses m\ and m 2 respectively. 

In order to generate a non- static (interaction) solution of Eq. (53) with two singular 
points moving on their own trajectories, we modify the definition of the denominators. 
Let '0i(O and '02(0 describe the trajectories of the singular points. Thus we come 
to an ansatz of the form 







(63) 



where 



Ri = ||/?,-|| = ||(r-r0 -0KOII i = 1,2. (64) 

We search for conditions on the functions 0/(0 that turn (63) into an approximate 
solution of (53). Substituting (63) into Eq. (53) and removing the terms of the order 
O(ll'0/lP)> weget 



< Ri, -01 



mi 



l—k(mi/Ri) 



< R 2 , '02 > 
rI 1 - kim^IRi) 



nn 

I 



—km\m2 <R\, R2> 



RjRl (1 - k(mi/Ri)) (1 - k(m 2 lR 2 )) 



(65) 



Both sides of Eq. (65) are functions of a point r. They are singular at two points 
where R\ = 0 and R 2 = 0, i.e., at the curves 



r = ri— 0i(0, and r = r 2 — 02 ( 0 , (66) 



and regular at all other points (Eig. 2). 

Let us examine how the relation (65) can be satisfied in a neighborhood of a 
singular point. We consider near zones around two masses. They include points that 
are close enough to the singularities but yet far from the corresponding Schwarzschild 
spheres R = km. In these domains, we can essentially simplify Eq. (65) 



mi •• m 2 •: 



—km\m2 <R\, R2> 



RjR 



3 

2 



( 67 ) 
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Fig. 2 On the picture, two masses m\ and m 2 move along their trajectories and '02(0 

respectively. Small circles around the masses represent Schwarzschild spheres R\ = km\ and 
R 2 =km 2 . Big circles show the near zones of the particles 



We will examine the field at an arbitrary point r close to the mass m \ . Consequently 
we have R\ ^ 0. For the second singularity 

/?i ^ 0 yields R 2 Rn, (68) 



where R 12 is sl vector directed from the point m 2 to the point mi. 

On the left hand side of the equation (67), there is a singular term. It must be 
compensated by the corresponding singular term on the right hand side. These terms 
are of the same order near the first singularity. So they cancel each other in 

the case when the corresponding leading coefficients are equal. Consequently, 



mi< Ri,ipi > = 



—km\m2 <R\,R2> 



R 



3 

2 



(69) 



Take into account that the point r (in 1 ) is arbitrary. Hence (69) is valid only if 



miipi 



-km\m2R2 

M ■ 



For small values of r, we have in (70) 



m\ipi = 



—km\m2R\2 



R 



12 



(70) 



(71) 



For k > 0, Eq. (71) results in repulsion between the particles. The negative values 
of the parameter k yield attraction. The absolute value of k is unimportant, since it 
amounts to the rescaling of the mass. This way the Newton-type law of attraction is 
derived from the scalar field equation. 
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4 Discussion 

We present here an illustrative example of a non-linear scalar model that allows the 
derivation of the equation of motion. In fact, this model has some plausible features 
as Newton’s behavior at infinity and Schwarzschild type form near the origin. So it 
can be used as a laboratory for the real gravity, compare [23, 24] . 

Is it possible to extend our consideration to the electromagnetism? In fact, there is 
an essential difference in physics of electromagnetism and gravity. Observe first that 
we can hold two electric charges in arbitrary positions and study, with Maxwell’s 
equations, the electric field between them. Quite similarly we can hold two masses 
and study, with Einstein’s equations, the gravity field between them. In both cases, 
we are dealing with open systems. 

For a closed gravitational system containing masses and gravity field, two mas- 
sive particles cannot be in a rest or in a free motion. In Newton’s gravity, such 
behavior is allowed by the field equation Dp = 0 but it is forbidden by the second 
law of dynamics. In GR, such solution is forbidden already by the field equations 
themselves. 

In an electromagnetic system containing only electric charges and electromag- 
netic field. Maxwell’s system itself allows arbitrary motion of charges. In this sense. 
Maxwell’s system itself is incomplete. In order to forbid these arbitrary motions, one 
must arranged the system with the second Newton’s law, in particular with the notion 
of mass. Indeed the charges necessary have the non-zero masses. But the notion of 
mass is not related to the electromagnetism, it is a gravity phenomena. It means that 
electromagnetism cannot exist without gravity, even the gravity effects are very small 
in ordinary situations. In this sense electromagnetism itself must be considered as an 
open system. One can guess that in the Einstein-Maxwell system (or in some type 
of a unified theory), the equations of motion of massive charges must be derivable 
from the field equations as in the pure Einstein’s system. 
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Motion of Spinless Particles 
in Gravitational Fields 



Alexander J. Silenko 



Abstract The covariant Klein-Gordon equation describing a scalar particle in a 
Riemannian spacetime of general relativity is transformed to a Hamiltonian form by 
the generalized Feshbach-Villars method applicable for both massive and massless 
particles. The subsequent Foldy-Wouthuysen (FW) transformation allows to derive 
the relativistic FW Hamiltonian for a wide class of inertial and gravitational fields 
and find the new manifestation of conformal invariance for a massless scalar particle. 
Similarity of manifestations of conformal invariance for massless scalar and Dirac 
particles is proved. New exact FW Hamiltonians are derived for both massive and 
massless scalar particles in a general static spacetime and in a frame rotating in 
the Kerr field approximated by a spatially isotropic metric. The latter case covers 
an observer on the ground of the Earth or on a satellite and takes into account the 
Lense-Thirring (LT) effect. High-precision formulas are obtained for an arbitrary 
spacetime metric. General quantum-mechanical equations of motion are derived. 
Their classical limit coincides with corresponding classical equations. The quantum- 
mechanical description of the relativistic LT effect is presented. The exact evolution 
of the angular momentum operator in the Kerr field approximated by a spatially 
isotropic metric is found. The quantum-mechanical description of the full LT effect 
based on the Laplace-Runge-Lenz vector is given in the nonrelativistic and weak- 
field approximation. Relativistic quantum-mechanical equations for the velocity and 
acceleration operators are obtained. The equation for the acceleration defines the 
Coriolis-like and centrifugal-like accelerations and presents the quantum-mechanical 
description of the frame-dragging effect. 
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1 Introduction 

The problem of quantum-mechanical description of a spinless particle in a Rie- 
mannian spacetime of general relativity (GR) has a long history. An appropriate 
form of the initial covariant Klein-Gordon (KG) equation [1-4] has been discovered 
fifty years ago by Penrose [5]. He has found an appropriate term describing a non- 
minimal coupling to the scalar curvature and conserving the conformal invariance 
of the equation for a massless scalar particle. Chemikov and Tagirov [6] have given 
clear explanations of this wonderful result. Their study involved the case of a nonzero 
mass and ^-dimensional Riemannian spacetime. It is important that the inclusion of 
the Penrose-Chernikov-Tagirov term has been argued for both massive and massless 
particles [6] . 

Next step in investigation of the problem of conformal invariance of the KG equa- 
tion has been made by Accioly and Bias [7, 8]. They have performed the exact FW 
transformation for a massive spin-0 particle in static spacetimes and have found 
new telling arguments in favour of the predicted coupling to the scalar curvature. 
A derivation of the relativistic FW Hamiltonian fulfilled in Refs. [7, 8] is important 
for a comparison of gravitational (and inertial) effects for scalar and Dirac particles. 
However, the transformation method used in Refs. [7, 8] is inapplicable to massless 
particles. In addition, it cannot be employed for nonstatic spacetimes. This does 
not allow to obtain an information about a specific manifestation of the conformal 
invariance in the FW representation. 

A procedure of derivation of relativistic quantum-mechanical equations of motion 
from the FW Hamiltonians is fairly well known. Such a procedure was widely used for 
a Dirac particle in noninertial and gravitational fields (see Refs. [9-13] and references 
therein). It has been proved in Ref. [14] that satisfying the condition of the Wentzel- 
Kramers-Brillouin approximation allows to use this approximation in the relativistic 
case and to obtain a classical limit of the relativistic quantum-mechanical equations 
of motion. 

In the present work, we consider a scalar particle in arbitrary Riemannian space- 
times. To obtain a Hamiltonian form of the initial covariant KG equation not only 
for massive particles but also for massless ones, we use the generalization of the 
Feshbach-Villars transformation [15] proposed in Ref. [16]. Then we fulfill the FW 
transformation and prove the conformal invariance of the relativistic FW Hamiltonian 
for a wide class of inertial and gravitational fields. We derive general quantum- 
mechanical equations of motion and obtain their classical limit. As an example, 
quantum-mechanical description of the LT effect is presented and its classical limit 
is found. 

We denote world and spatial indexes by Greek and Latin letters a, //, z/, . . . = 
0, 1,2,3, i, j,k, . . . = 1,2, 3, respectively. Tetrad indexes are denoted by Latin 
letters from the beginning of the alphabet, a,b,c, . . . = 0, 1,2, 3. Temporal and 

spatial tetrad indexes are distinguished by hats. The signature is (H ), the Ricci 

scalar curvature is defined by R = where — . . . 

is the Riemann curvature tensor. We use the system of units h= c = \ except for 

some specific expressions. 
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2 Importance of the Penrose-Chernikov-Tagirov Term for 
Massive Scalar Particles 



The covariant KG [1-4] equation with the additional term [5, 6] describes a scalar 
particle in a Riemannian spacetime and is given by 



(□ + 



XR)^ = 0, 



□ = 









( 1 ) 



Minimal (zero) coupling corresponds to A = 0, while the Penrose-Chernikov-Tagirov 
coupling is defined by A = 1 /6. Sign of the Penrose-Chernikov-Tagirov term depends 
on the definition of R . For noninertial (accelerated and rotating) frames, the spacetime 
is fiat and R = 0. 

For massless particles, the conformal transformation 

9iiv — O Qijii/ ( 2 ) 



conserves the form of Eq. (1) but changes the operators and the wave function [5, 6]: 




t/; = 0, ^ = O'ljj. 



In Ref. [6], higher dimensionality was also considered. 
The corresponding classical equation 

9^‘'PfiPiy = 0 



(3) 



is also conformal for a massless particle. It does not contain any nonminimal coupling 
to the scalar curvature. Therefore, the square of the classical momentum corresponds 
to the operator — R/6) [6]. 

Chernikov and Tagirov [6] have shown the importance of the additional term for 
massive particles. They have proved that the requirement for motion to be quasiclas- 
sical for a large momentum is satisfied for massive and massless particles only when 
A = 1/6. This choice of A has been additionally substantiated in Refs. [17-19]. 

Important development of problem of the Penrose-Chernikov-Tagirov coupling 
in the GR for massive particles has been made by Accioly and Bias [7, 8]. They 
analyzed a dependence of the form of the FW Hamiltonian on the value of A and 
considered the diagonal static metric in isotropic coordinates: 

ds^ = V{rf{dx^f - W{rf{drf (4) 



with arbitrary V(r), W(r). The choice of the metric allowed an exact FW transfor- 
mation by the method used in Ref. [7, 8]. This method included the Feshbach-Villars 
transformation (unappropriate for massless particles) in order to bring initial Eq. (1) 
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to the Hamiltonian form. Next nonunitary and FW transformations resulted in the 
FW Hamiltonian [7, 8]: 



nFw=P3yjm'^V^+Tp'^T - 1 VjP- VJ^+D aCV, W), (5) 

where /? = — /V is the momentum operator, pi (i = 1, 2, 3) are the Pauli matrices, 
= V I W, and the Darwin term is given by 

mV.W) = ^AV + ^AW-2x[:^AV 
+ + 2^AW - £(VIV)2] . 

Only for A = 1/6, this term has the simple form and is equal to \T /^T [7, 8]. 

However, the important result obtained by Accioly and Bias demonstrates only a 
shadow of the conformal invariance, because it does not cover the case of m = 0. 
We perform general examination of the problem. 



3 Generalized Feshbach-Villars Transformation 

General form of the covariant KG equation is given by 

(^0 + ^ 007 ^ {di , dj + i> = 0. (7) 

The curly bracket denotes the anticommutator. 

There is an ambiguity [20] in the definition of parameter of the Feshbach-Villars 
transformation. We use the generalized Feshbach-Villars (GFV) transformation pro- 
posed in Ref. [16] and based on this ambiguity. In the considered case, the GFV 

transformation consists in an introduction of the new wave function ^ = 

which components are given by wave function: 

V’ = 0 + X. i(do + T)ip = N{<p-x), 

Here N is an arbitrary nonzero real parameter. For the Feshbach-Villars transforma- 
tion, it is definite and equal to the particle mass m. The two components form the 

which normalization is defined by f {(jf(j) — x'^x)dV= 1. 

The use of Eq. (8) results in 



wave function 







- X*X = {do + T) 
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Since the original Feshbach-Villars transformation corresponds to = m, it 
becomes (unlike the GFV one) unapplicable for a massless particle. 

The GFV transformation allows us to represent Eq. (7) in the Hamiltonian form 
describing both massive and massless particles [21]: 



T = g(yb dj + ™ - T^. 



(9) 



Similarly to Refs. [7,8], then we perform the nonunitary transformation 
to obtain a pseudo-Hermitian (more exactly, ps-pseudo-Hermitian) Hamiltonian: 
Ji' = H' = p^H'^ p 3 . In the case under consideration, 

7 ( 10 ) 

r = - (T2)'. 

Transformed operators are denoted by primes and (T^)' = Tedious but simple 
calculations result in [21] 



07 / N^ + T^ , ; -N^ + T^ ;^/ 

/T — P3 2N ^ P2 2N ’ 



r = di%dj + 



2N 
m^—XR 



+ Vj (}) 



+y^G9v.v,.(i) + ^[v,(r)f 

-2^1 V,- (^) V; (r/) - ^ W; (r/) , 



( 11 ) 



where the nabla operators act only on the operators in brackets and the primes denote 
transformed operators. Equation (1 1) is exact and covers any inertial and gravitational 
fields. 



4 Foldy-Wouthyusen Transformation 

General methods of the FW transformation for relativistic particles have been devel- 
oped in Refs. [16, 22, 23]. They belong to step-by-step methods performing the 
transformation as a result of subsequent iterations. We use the version [16] adapted 
to scalar particles. 
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The GFV Hamiltonian can be decomposed into operators commuting {M and £) 
and anticommuting (O) with p 3 : 



H' = P 3 M + f + O, P 3 M = Mp3, 



P3£ = P 3 O = -Op3, 



M = 



N^+T' c 
2N ’ ^ 



-iT 



f 



O = ip2 



-N^+T' 

2N 



( 12 ) 



The relativistic FW transformation is carried out with the p 3 -pseudounitary oper- 
ator(C/t = p3t/-V3)[16] 



U = 1±^±££^ e = ^M^ + 0^. 
V2e(e + M) 

In the considered case, 

e + jV + pi(e-jV) 

2^/eN 



(13) 



(14) 



It is important that the Hamiltonian obtained as a result of the transformation does 
not depend on [16]: 



'£i\— p3e-\-£\-\-Oi, Vp3^ ^i\ — {p2^ 0\] — 0, 

= (15) 

Q = -idQ-ir. 

This shows a self-consistency of the used transformation method. 

Next transformation [16] eliminates the residual odd term and results in 



Hfw = + 



4 



1 

e 






(16) 



While relativistic step-by-step methods perfectly describe the contact (Darwin) inter- 
action and strong field effects (see Refs. [9-1 3]), all of them are approximate [24, 25] . 
Allowance for the last term in Eq. (16) exceeds the achievable accuracy. Its omitting 
leads to the final form of the approximate relativistic FW Hamiltonian [21]: 

WFiv=p3e-lT'-^[Ve, [Ve, (/5o+«T')]];^. 



(17) 
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5 Exact Foldy-Wouthuysen Transformation 
and Conformal Invariance 

The used method ensures the exact FW transformation for a wide class of spacetime 
metrics. The manifestation of conformal invariance can also be investigated in detail. 

The sufficient condition of the exact FW transformation [16, 22, 23] applied to 
scalar particles is given by [T' , Q] = 0 or, equivalently, OqT' — [T', T^] = 0. When 
it is satisfied, the exact FW Hamiltonian reads 



Equation (18) covers all static spacetimes {T' = 0) and some important cases of 
stationary ones. 

Since general expressions for the scalar Ricci curvature are very cumbersome, we 
restrict ourselves to an analysis of several special cases. For the metric defined by 
Eq. (4), the result of our calculations formally coincides with Eq. (5). However, the 
case of m = 0 can now be considered. Explicit expression for V\(V, W) [7, 8] shows 
the presence of conformal invariance for massless particles if and only if A = 1/6. 
In this case, conformal transformation (2) does not change the FW Hamiltonian and 
the FW wave function These manifestations of conformal invariance radically 
differ from those for the covariant KG equation and the corresponding wave function. 

The validity of the found properties can be checked for the scalar particle in 
nonstatic spacetimes. The metric of the rotating Kerr source has been reduced to the 
Arnowitt-Deser-Misner form [26] by Hergt and Schafer [27]. This form reproduces 
the Kerr solution only approximately. The form of the metric can be additionally 
simplified due to an introduction of spatially isotropic coordinates and dropping 
terms violating the isotropy [13]: 



The use of the approximate Kerr metric allows to fulfill the exact FW transforma- 
tion when V, W, and lv depend only on the isotropic radial coordinate r. In this 
approximation, the metric is defined by 



Tipw — ~ 



(18) 



ds^ = — W^Sij(dx^ — dx^){dx^ 

-KJdx^), K =iv xr. 



(19) 




( 20 ) 



Here a = J ! {Me) , /i = GM/c^ ; the total mass M and the total angular momentum 
J (directed along the z axis) define the Kerr source uniquely. The leading term in 
the expression for uj{r) = u{r)ez corresponds to the LT approximation. 
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We can pass on from the Kerr field approximated by Eqs. (19) and (20) to a frame 
rotating in this field with the angular velocity o after the transformation dx^ 
dX^ = dx^ -\-(o X r)dx^. The stationary metric of this frame can be obtained from 
Eqs. (19) and (20) with the replacement u) = — In particular, it covers an 

observer on the ground of a rotating source like the Earth or on a satellite. In this 
case, o = ///, where I is the moment of inertia. It should be taken into account that 
frames rotating in the isotropic and Cartesian coordinates are not equivalent. 

In the isotropic spherical coordinates, the metric takes the form 

ds^ = [V^{r) - W^{r)Q?{r)r^ e]{dx^)^ 

-W^{r)[2Q.{r)r^ sin^ 0dx^ dip + dr^ + r^{d0^ + sin^ 0dp^)]. 

The exact EW Hamiltonian is given by Eq. (18) where [21] 



and derivatives with respect to r are denoted by indexes. In particular, for the LT 
metric 



While metric (19) and (20) reproduces the Kerr solution only approximately, 
the derivation of the exact EW Hamiltonian corresponding to this metric allows an 
independent unambiguous determination of the value of A. If and only if A = 1/6, 
conformal transformation (2) changes neither T' nor Hfw, This property is 

the same as for the static metric. 



6 General Equations of Motion 

Equations for the EW Hamiltonian allow us to derive general quantum-mechanical 
equations of motion and then obtain their classical limit (h 0). The quantum- 
mechanical equations of motion defining the force, velocity, and acceleration read 
(po = 'Hfw) 



r = i + Vx(V, W) 

+ f(x2 + y2)(J2;>2, Vx(V,W) = 



( 22 ) 




—iT' = • (r X p), 




(23) 




Any commutation adds the factor h as compared with the product of operators. 
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It has been proved in Ref. [14] that satisfying the condition of the Wentzel- 
Kramers-Brillouin approximation allows to use this approximation in the relativistic 
case and to obtain a classical limit of the relativistic quantum mechanics. Deter- 
mination of the classical limit reduces to the replacement of operators in the FW 
Hamiltonian and quantum-mechanical equations of motion in the FW representation 
by the respective classical quantities. The classical limit of the general FW Hamil- 
tonian is given by [21] 



H 



_ - G'i Pi 

“V 5 “ ) 



9^' Pi 

„oo ■ 



(25) 



It coincides with the classical Hamiltonian derived in Ref. [28]. The classical limit 
of Eq. (24) reads 



y/ _ Pj ^ 

^gOO(jn^-Gij Pi pj) ’ ^ 26 ) 

= Pi^Hr + 5 °'^ + 9'^djH + Pi^VJdjg^^^. 

It coincides with the corresponding classical equations which follow from Hamil- 
tonian (25) and the Hamilton equations. Thus, the quantum-mechanical and classical 
equations are in the best compliance. 

For example, the exact metric of a general noninertial frame characterized by the 
acceleration a and the rotation o of an observer is defined byV = 1 a • r,W = 1, 
^ = —o[29]. In this case, the classical limit of the Hamiltonian and equations of 
motion is given by [21] 

H = (I a ’ r)y/m^ -h — o • (r x 

V = (1 +fl • r) , ^ -oxr, 

W = —a(l + fl-r) — 2oxV — ox 

+ 2«.V+«.(oxr) (V + oxr), 

where p = (—pi, —p2, ~P3)- Leading terms in Eq. (27) reproduce well-known 
classical results [30, 31]. 



P), 



(o X r) 



(27) 



7 Quantum-Mechanical Description 
of the Lense-Thirring Effect 

The results obtained allow to derive quantum-mechanical equations describing the 
LT effect. When a metric depends only on r , it is convenient to consider the evolution 
of the angular momentum operator I = r x p. Dynamics of this operator in a frame 
rotating in the Kerr field approximated by a spatially isotropic metric is defined by 
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dl i 

— = — [Tifw^ /] = X /, Qi = (jij — o. (28) 

dt h 

Since the operators and / commute, this equation is exact /6>r the chosen metric. 

The quantity u characterizes an evolution of the longitude of the ascending node, 
T: T = To + cjL Equations (20) and (28) provide for a relativistic post-Newtonian 
description of this evolution: 



_ 2GJ r 3GM 21G^M^ 



(29) 



This is a part of the LT effect. The longitude of the ascending node can be measured 
and its measurement is important for astrophysics. 

A transition to the classical limit [14] and a calculation of the period average in 
the nonrelativistic and weak-field approximation results in 



2GJ 






UJ = 






(30) 



where b is the semimajor axis and e is the eccentricity. 

The quantum-mechanical description of the full LT effect is based on the Laplace- 
Runge-Lenz (LRL) vector. In this case, we confine ourselves by the nonrelativistic 
and weak-field approximation. The operator form of the LRL vector is given by 



1 ^ ^ r 

A = -(pxl— lx p) — mkr, r = -, k = GMm. 

2 r 



(31) 



The nonrelativistic LW Hamiltonian for the Kerr field in the LT approximation reads 

.2 > 



Tipw = P3 h • /, 

\ r 2m / 



(32) 



where is defined by Eq. (23). The precession of pericenter of the orbit is defined 
by the commutator of the operators Hfw and A: 



dA i_ 

dt h 



[Hfw, ^] = 5 (Si X a - a X ft) + P (r X / - Z X r)| . (33) 



The transition to the classical limit and the calculation of the period average brings 
the LT equation: 



^ = ftir X A, Qlt = ^[J- 3(7 -T)T] , (34) 



where / — Ijl. 
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The existence of the frame dragging can also be shown. If we hold only main 
terms in the relativistic FW Hamiltonian presented by Eqs. (18) and (22), the velocity 
operator in the field defined by the LT metric (23) is given by 



V = f 



a / rri^c^ 



.TpT 



+ X r. 



(35) 



In the weak-field approximation, the part of the acceleration operator defined only 
by the rotation of the source is equal to 



W = ^^xV — Vx^2 — S2x(^2xr). 



(36) 



This equation defines the Coriolis-like and centrifugal-like accelerations and 
therefore describes the quantum-mechanical frame-dragging effect. 

It is important that the classical limit of quantum-mechanical equations coincides 
with the corresponding classical equations. This unambiguously shows a deep con- 
nection between the relativistic quantum mechanics of scalar particles in Riemannian 
spacetimes and the classical GR. 



8 Conformal Invariance for Dirac and Classical Particles 

It is possible to compare the conformal transformations in the GR for massless 
scalar, Dirac, and classical particles. Our analysis shows that the general Hermitian 
Dirac Hamiltonian for a massless particle in an arbitrary metric in the presence 
of an electromagnetic field [13] is not changed by the transformation (2). The FW 
transformation operator for particles in strong external fields obtained in Ref. [23] 
is also conformally invariant in the case of m = 0. As a result, the Dirac and FW 
wave functions, and and the FW Hamiltonian remain unchanged. These 

properties of Dirac particles are the same as for scalar ones. The Hamiltonian of 
massless classical particles is conformally invariant even if its spin-dependent part 
defined by Eqs. (3.18) and (4.12) in Ref. [13] is taken into account. 

In the general case, the transformation of the initial covariant Dirac equa- 
tion to the Hermitian Hamiltonian form is performed by the nonunitary operator 
/d = [13]. Since the transformation (2) leads to /d = the con- 

formally transformed wave function of the initial covariant Dirac equation, 4^, reads 

^ = /d V = C»3/2xi/. (37) 

While its transformation is similar to that for the scalar particles, the powers of O in 
Eqs. (3) and (37) differ. 
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The second-order wave equation for the Dirac particles in general electromagnetic 
and gravitational fields derived in Ref. [13] includes the term describing a nonminimal 
coupling to the scalar curvature R. As the definitions of R in Ref. [13] and the present 
work differ in sign, this term corresponds to A = 1 /4. 



9 Conclusions 

The use of the GFV and relativistic FW transformations allows to describe the both 
massive and massless scalar particles in general noninertial frames and gravitational 
fields. The present work demonstrates the new manifestation of the conformal invari- 
ance for massless particles. The conformal transformation conserves the FW Hamil- 
tonian and the FW wave function while it changes the wave function of the initial 
KG equation. The similar conclusion is valid for the Dirac particles. The nonmini- 
mal coupling to the scalar curvature is a common property of the scalar and Dirac 
particles. 

The results obtained in Refs. [7, 8, 21] allow to state the general property of 
conformal symmetry for massive particles. Conformal transformation (2) changes 
only such terms in the FW Hamiltonian which are proportional to the particle mass m . 
This property is valid not only for real scalars (Higgs boson) but also for compound 
ones (zero-spin atoms and nuclei). Dirac particles possess the same property. 

Contemporary methods of (pseudo)unitary and nonunitary transformations make 
it possible to derive new exact FW Hamiltonians for both massive and massless scalar 
particles i) in the general static spacetime and ii) in the frame rotating in the Kerr field 
approximated by a spatially isotropic metric. The latter result covers an observer on 
the ground of the Earth or on a satellite. It reproduces not only the well-known effects 
of the rotating frame but also the LT effect. For an arbitrary metric, high-precision 
formula (17) is obtained. 

The classical limit of the derived general quantum-mechanical equations of motion 
coincides with corresponding classical equations. This important general conclusion 
is confirmed by the quantum-mechanical description of the relativistic LT effect. The 
exact evolution of the angular momentum operator in the Kerr field approximated 
by a spatially isotropic metric is found. The quantum-mechanical description of the 
full LT effect based on the Laplace-Runge-Lenz vector is given in the nonrelativistic 
and weak-field approximation. Relativistic quantum-mechanical equations for the 
velocity and acceleration operators are obtained. The equation for the acceleration 
defines the Coriolis-like and centrifugal-like accelerations and presents the quantum- 
mechanical description of the frame-dragging effect. 
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On the Strong Field Point Particle Limit 
and Equation of Motion in General Relativity 



Toshifumi Futamase 



Abstract Strong field point particle limit is developed in order to take into account 
strong internal gravity in the post-Newtonian approximation. The concept is also 
applicable to the motion of a fast moving particle. We give an introduction of the 
concept and show the application to the equation of motion. 



1 Introduction 

Recent interest on the equation of motion in general relativity is motivated by the 
development of gravitational telescope. It is expected that gravitational waves will 
be directly observed in very near future and will open up a new branch of astronomy, 
gravitational wave astronomy. However it is obvious that the detailed theoretical 
understanding for the source and the mechanism of wave generation are indispens- 
able in order to interpret the observed data. This is particularly important for gravi- 
tational waves because the signals are extremely small. Without theoretical template 
no meaningful information can be obtained. This is the reason of renewed interest 
on the relativistic equation of motion. Although the equation of motion has a long 
history starting right after the development of general relativity, it has not been fully 
understood. 

For systems whose velocity of typical motion are much less than the speed of 
light, one can use of the ration between the speed of light and the typical velocity as 
a smallness parameter e to develop the so-called slow motion approximation. If the 
system in concern is gravitationally bounded and the typical motion is governed by its 
gravitational potential, the slow-motion expansion is at the same time the expansion 
with respect to the gravitational potential and thus is called post-Newtonian approxi- 
mation [1-4] . The post-Newtonian approximation is applied for various astrophysical 
systems including planetary motion in solar system and relativistic compact binaries 
with remarkable success [5-8] . The predicted orbital motion for the compact binaries 
is now used as a tool to determine physical parameters such as masses of component 
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stars. However the usual treatment of the post-Newtonian approximation assumes 
that the field is everywhere weak including inside the component stars which is not 
applicable for relativistic binaries. Another difficulty of the post-Newtonian approx- 
imation is that the convergence of the expansion series becomes very slow in higher 
orders. At present 3.5 post-Newtonian approximation has been calculated [9-11], 
but there is a discussion that 4.5 post-Newtonian calculation will be necessary to 
make sufficiently accurate theoretical templates for predicting accurate waveform 
from coalescing binaries. Strong internal gravity in compact binaries becomes prob- 
lematic in the calculation of higher-order post-Newtonian approximation because of 
high nonlinearity. The use of singular function such as Dirac delta function in higher 
order calculation is not satisfactory not only mathematically but also physically. 
Mathematically one cannot define singular function in nonlinear theory. Physically 
there is not singular source in general relativity. When one makes a singular source, 
one ends up a black hole which is regular horizon at least from outside. Although 
there are some elegant methods to deal with singular source in higher-order post- 
Newtonian calculation [12-14], it would be better to formulate the post-Newtonian 
approximation in such a way that it automatically take the strong internal gravity 
into account without singular sources. This is done by introducing the strong field 
point particle limit in the post-Newtonian approximation [15, 16]. 

For systems whose motion is highly relativistic, there are some development 
recent years and the equation of motion is known [17-20]. However it is still u 
satisfactory in the sense that practical treatment is possible only for limited type of 
orbital motions [21-24] (however there have been some progresses in this direction 
recently [25-27]). Although higher order equation of motion in the relativistic case 
is not known yet, the use of singular source would cause serious theoretical difficulty. 
In this sense it is preferable to formulate the equation of motion without singular 
source. The strong field point particle limit may have a possibility to have such a 
formulation. 

This article is the introduction to the concept of strong field point particle limit 
and its application to the equation of motion in general relativity. In Sect. 2, we 
give a formulation of the post-Newtonian approximation on the space of solutions of 
Einstein equation whose initial data satisfy the Newtonian scaling. Then we introduce 
the point particle limit introduce and derive the equation of motion in the surface 
integral form in Sect. 3. In Sect. 4, we apply the strong field point particle limit to 
derive the equation of motion for a fast moving particle. Although the derivation is 
still in the stage of formal, we expect to be useful for future practical application. 
Finally some discussions are given. 



2 Newtonian Scaling and Strong Field Point Particle Limit 

As discussed above, the strong field point particle limit is introduced in order to take 
the strong internal gravity of the relativistic binary system in the post-Newtonian 
approximation. This is done by realizing that the Newtonian scaling with respect 
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to the smallness parameter has a room to allow the strong internal gravity. Thus 
we first formulate the Newtonian approximation based on the Newtonian scaling. 
The post-Newtonian approximation is actually the asymptotic approximation to the 
sequence of solutions of Einstein equation which is defined by Newtonian initial data 
[28]. In the case of binary system, the Newtonian initial data is characterized by the 
following scaling relation. 



^ e, m ^ (1) 

where is the orbital speed, and m is the mass of component stars. This relation 
guarantees the Newtonian orbit at least for some short time. Relativistic effects will 
deviate the scaling. The post-Newtonian approximation becomes accurate as the 
parameter goes to zero. This means that the limit e ^ 0 is should be understood, but 
the point is that the limit taken is along the trajectory on the sequence which connects 
physically equivalent events at each solution. Namely the time interval is also scaled 
as thus appropriate time coordinate on the sequence is not the coordinate time 
but the Newtonian time r = eL In fact as e goes zero, the Newtonian time stay 
constant and the limit is taken along the trajectory on which r = constant. 

It should be noticed that it is sufficient for mass not the density to scale as in 
order to guarantee the Newtonian behavior for the orbital motion. This is important 
because it allows us to have strong internal gravity for the component stars. Namely 
one can fix gravitational potential in the course of limit by assuming the size of the 
component stars shrink as e^. This means the density scales as This scaling 
suggests the introduction of the body coordinates whose relation with the near 
zone coordinates (v^) is as follows: 

x‘(t) = (t) + (t) (2) 

where we consider a system of compact binary and z\ is the center of mass coordinate 
of star A (A = 1,2). We have used the Newtonian dynamical time r because we 
neglect dynamical change of the internal structure. This is guaranteed by taking 
equilibrium internal configuration for the component stars as their initial data (the 
tidally induced change will appear at 4th post-Newtonian order). We further define 
the body zone Ba whose boundary is defined as 

\x-za\ = ^Ra (3) 

where Ra is some fixed number. This boundary shrinks as e in the limit e ^ 0 
in the near zone coordinate, but expands as e~^ in the body zone coordinate. Thus 
the boundary is far zone from the point of view of component star and thus the 
gravitational field by the component star may be obtained by far zone expansion 
(multipole expansion). 

This property is essential to calculate higher-order post-Newtonian approximation 
without any singular behavior and also deriving the formal equation of motion for a 
fast moving particle. 
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3 Bases of the Post-Newtonian Calculation 

In this section we explain how the strong field point particle limit is implemented 
in the actual calculation of post-Newtonian approximation. This is done by writing 
equation of motion in the integral form. 

We first define the 4-momentum of each component star as 



where is the Landau-Lifshitz pseudotensor [4], and the 

Landau-Lifshitz variable. Then we have 



This is not the equation of motion in the usual sense. We need the relation between 



These relations suffer from higher-order post-Newtonian corrections. The energy- 
mass relation is obtained from the time component of the above equation. The 
momentum- velocity relation is given by 




(4) 



where the integral is over the body zone Ba- is defined as 




(5) 




( 6 ) 



the energy and mass mA as well as the relation between the momentum and 
the velocity . In the Newtonian order these are 



Pl = rriA 
P\ = Ma 



(7) 

( 8 ) 




(9) 



where y‘ — z\ (r) and 




( 10 ) 



( 11 ) 
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is the dipole moment of the star A which we can choose arbitrary value. The 
above relation is obtained from the identity 

©"■ = ^ (12) 

This is the consequence of the conservation law = 0. 

Then we have the following equation of motion in the integral form. 



dy\ 

dr 




+ ^4 






+ 




dQ\ 

dr 



dr^ 



(13) 

(14) 



As mentioned above the energy mass relation is obtained from the time component 
of the Eq. (5). 

The first step of the calculation is to write down the formal solution of the Einstein 
equation. 



h^^(r,x) =4 d^y 



/ 



. Al^‘'(T-e\x-%y) 



l^-y| 



(15) 



This integral is divided into two parts, one from the body zone and the other from 
the other region except the body zone. 



/jb(t, x) 



x) 




3 A/'*"(r - e|?A - Ha + £^5a) 

da 1; 3-1; 

kA - e^CTAl 



. f ,3 A/'^(r-e|x-y|,y) 

4 / d y — 

Jn/b a — )’l 



(16) 

(17) 



where rA = y — za and N/B means the near zone region except the body zones. 
One can show by explicit calculation that the artificial boundary does not contribute 
to the final result [29]. 

In the surface integral approach we only need the gravitational field on the body 
zone boundary 8Ba . Since the body zone boundary is the far zone in the body zone 
coordinate the field hs is calculated by multipole expansion. On the other hand the 
hs/N is calculated by using the following formula. 




d^y 



f(y) 

\x - y\ 



—47rg(x) -\- f dSk 

Jd(B/N) 



1 



dgiy) 



_|x — y| dy^ 



^ d 



(|k-yl). 



(18) 



where g is the super potential of /, namely 



As(x) = /(x) 



(19) 
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For example, if / = l/(rir 2 ), then 

g = ln(ri + T 2 + rn) (20) 

Up to 2.5 post-Newtonian order all the necessary super-potentials have been known 
by the work of Jaranowski and Schafer [30, 31]. In higher-order calculation we 
have integrals in which super-potential cannot be found. Even in these cases, we can 
evaluate the field at the body zone boundary [11]. 

The surface integral expression for the equation of motion allows us to calculate 
the higher-order equation of motion step by step starting from only non- vanishing 
Newtonian field. 



hrr =4e^y^ ( 21 ) 

A 

with 



ma = lim Pi (22) 

e^O 

P\=mAv\ (23) 

Detailed calculation are presented in the references [16, 29, 32]. 



4 Application to the Fast Motion Equation of Motion 

We shall apply the point particle limit to the equation of motion for a fast moving 
small body in an arbitrary background field. Our starting point is the integral form 
of equation of motion derived in the previous section. 

d P^ f f 

— ^ ~ lim (f) dSn^Qi^^' = - lim / dQnAi + ea ■ n)ff r las (24) 

dr e-^^JdBA "-^0 J 

Here we used © = (—g)(T -h tii) instead of A in the integrand because the essential 
contribution comes from this part, and is the unit normal vector to the surface and 

( 2 ^ = Du^ /dr the 4-acceleration. 

Let suppose we have a background field and write the deviation from the 
background as h in this section (do not confuse h in the previous section). The the 
deviation field contains the singular field at the position of the particle if it is really a 
point. For the irrotational extended source the field which fall off as corresponds 
to the singular field. We shall write down the monopole field as 5g, then the whole 
field at the body boundary may be written as follows. 



9 = 9b+h= gs+6g 



(25) 
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where gs is the sum of the background and the regular part of the self field. There 
is no ambiguity of separating the regular and singular field because the separation 
is done on the body zone boundary. Then the Landau-Lifshitz pseudotensor at the 
body zone boundary may be symbolically written as follows 

®\dB - IllIob - rfe +<55)r(5(i +5g) 

- rigs)d(gs) + r(gs}d{6g) + r(6g}d{6g) 

where T represents Christoffel symbol. Each term has different e dependences 
because Qs ^ 0(e^), 6g ^ and thus the second term r(gs)d(6g) cancels 
another dependence coming from surface element and gives nonzero contribu- 
tion. The third term vanishes by the angular integral. 

Let us make the above argument more explicit. The singular part of the self field 
may be written as follows. 



Sgi^^'ix) = lj y)5{a{x, y))T<^^ (26) 

where a is the world function and the field point is supposed to be on the body zone 
boundary, and is the coefficient of singular part in the retarded tensor Greens 
function [20, 33]. In the above expression we do not consider the spinning particle. 
Note that our field point v is on the body zone boundary which is in the far zone of 
the star. This allows us to make use of the multipole expansion. For this purpose we 
choose our coordinate as 



= 2 «(r) + Sy"^, z^{t) = r, 5y^ = 0 (27) 



where z(r) is the world line of the center of the star. The center is assumed always to 
be inside the body in the point particle limit, and thus there is no ambiguity regarding 
the choice of the center. Thus we have 

5g^''{x) = — z(Tz))z"(Tj:)j^(Tj:) (28) 

a(x,z(T^}) "P 

where is the retarded time at the center given by cr(x, z(r^)) = 0, and 



mz^{Tz)z^{Tz) = lim [ (29) 

e->0 J 

Since we are going to take the point particle limit, the world function is simply 
approximated as l/2gi^y{x — zY{x — zY and ~ we have 



Sg^^ 



4mu'^u^ 



(30) 
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where = z, r^=x^ — and the z and u are evaluated at the retarded time. 
Thus its derivative may be evaluated at the body zone boundary as follows 

+ ,31) 

Using the above expressions in the LL tensor, we have the following result in the 
point particle limit: 

d 1 

(32) 

Here we need the relation between 4-momentum and the mass. Higher-order post- 
Newtonian result indicates that the ADM mass is related to the 4-momentum as 

= y/-gsmu>^ (33) 

Finally we have the equation of motion for a fast moving particle. 

= 0 (34) 

This is the geodesic equation but not in the background geometry, but on the 
smooth part of the geometry which is the sum of the background and the smooth part 
of the self field. 



5 Conclusions 

We have reviewed the strong field point particle limit which allows one to take the 
strong internal gravity into account in the equation of motion without any artificial and 
unphysical singularities. Although the concept is developed within the framework 
of post-Newtonian approximation, it can be applicable to the equation of motion for 
a fast moving small body in an arbitrary background spacetime as long as the size 
of the body is much smaller than the typical scale of background geometry. It has 
been shown that the strong field point particle limit can be applicable to the equation 
of motion up to 3.5 post-Newtonian order without any mathematical difficulty. It is 
interesting to see if it is applicable to ,ore higher order calculation. In particularly 
there will be tidal effect at the 4th order and a simple point particle limit should 
be modified. Since our limit allows us to calculate higher-order moments of the 
component star as a simple expansion with respect to e, the tidal effect should be 
calculated without difficulty. In fact it is straightforward to formally calculate all the 
effect of Newtonian mass multipole moments in the equation of motion in the surface 
integral approach. 
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d^z\ 

dr^ 



=zz 



(-l)P+H2p + 2q + l)l\i; 



<M„> 



<Na> 



p\q\ 



p+q+2 

^12 



-N<iMpNq> ( 35 ) 



where r\2 = z\ — Z 2 , Nt = r{2/^i2, Mp = mim2 . . .nip is a corrective index, 
< • • • > denotes the symmetric-tracefree operation on the indices between the brack- 
ets, and are the Newtonian mass multipole moments of order 2p. 

Unfortunately the present derivation of equation of motion for a fast moving small 
body is still on a formal level, the strong field point particle limit dose not have any 
ambiguity for the separation of the singular part in the self field, there might be some 
way of deriving more tractable expression for the equation of motion. 

Acknowledgments This review is based on my talk at 524th WE Heraeus seminar on the “Equa- 
tions of Motion in Relativistic Gravity” I would like to thank Dirk Puetzfeld for giving me a chance 
to contribute in the seminar. I would also thank Yousuke Itoh for fruitful collaborations and useful 
discussions. I apologize not to cite and mention many relevant works on the subject of equation of 
motion, and strongly recommend reader to read the review articles such as Living Reviews. This 
work is supported by a Grant-in- Aid for Scientific Research from JSPS (Nos. 18072001, 20540245). 



References 



1. A. Einstein, L. Infeld, B. Hoffmann, Annals. Math. 39, 65 (1938) 

2. S. Chandrasekhar, RP Esposito, Astrophys. J. 160, 153 (1970) 

3. V. Eock, The Theory of Space Time and Gravitation (Pergamon, Oxford, 1959) 

4. L.D. Landau, E.M. Lifshitz, The Classical Theory of Fields (Pergamon, Oxford, 1962) 

5. L. Blanchet, T. Damour, Phys. Rev. D 37, 1410 (1988) 

6. L.P Grishchuk, S.M. Kopejkin, Sov. Astron. Lett. 9, 230 (1983) 

7. A.G. Wiseman, C.M. Will, Phys. Rev. D 44, 2945 (1991) 

8. M. Walker, C.M. Will, Phys. Rev. Lett. 45, 1751 (1980) 

9. C. Konigsdorffer, G. Eaye, G. Schafer, Phys. Rev. D 68 , 044004 (2003) 

10. S. Nissanke, L. Blanchet, Class. Quantum Gravity 22, 1007 (2005) 

11. Y. Itoh, Phys. Rev. D 80, 124003 (2009) 

12. T. Damour, P Jaranowski, G. Schafer, Phys. Rev. D 63, 044021 (2001) 

13. L. Blanchet. Living Rev. Relativ, 5(3) (2002) 

14. L. Blanchet, T. Damour, G. Esposito-Earese, Phys. Rev. D 69, 124007 (2004) 

15. T. Eutamase, Phys. Rev. D 36, 321 (1987) 

16. T. Eutamase and Y Itoh. Living Rev. Relativity, 10(2) (2007) 

17. T. Mino, M. Sasaki, T Tanaka, Phys. Rev. D 55, 3457 (1997) 

18. T. Quinn, B. Wald, Phys. Rev. D 56, 3381 (1997) 

19. S. Eukumoto, T. Eutamase, Y Itoh, Prog. Theor. Phys. 116, 423 (2006) 

20. E. Poisson. Living Rev. Relativity, 7(6) (2004) 

21. A. Pound, Phys. Rev. D 81, 024023 (2010) 

22. L. Barack, A. Ori, Phys. Rev. D 61, 061502(R) (2000) 

23. L. Barack, A. Ori, Phys. Rev. D 67, 024029 (2003) 

24. L. Barack, N. Sago, Phys. Rev. D 75, 064021 (2007) 

25. A. Pound, C. Merlin, L. Barack, Phys. Rev. D 89, 024009 (2014) 

26. N. Warburton, S. Akcay, L. Barack, J.R. Gair, N. Sago, Phys. Rev. D 85, 061501(R) (2012) 

27. T. Damour, Phys. Rev. D 81, 024017 (2010) 

28. T. Eutamase, B.E. Schutz, Phys. Rev. D 28, 2363 (1983) 



326 



T. Futamase 



29. Y. Itoh, T. Futamase, H. Asada, Phys. Rev. D 63, 064038 (2001) 

30. P. Jaranowski, G. Schafer, Phys. Rev. D 55, 4712 (1997) 

31. P. Jaranowski, G. Schafer, Phys. Rev. D 60, 124003 (1999) 

32. Y. Itoh, Phys. Rev. D 69, 064018 (2004) 

33. B.S. DeWitt, R.W. Brehme, Ann. Phys. 9, 220 (1960) 



Motion in Classical Field 
Theories and the Foundations 
of the Self-force Problem 



Abraham I. Harte 



Abstract This article serves as a pedagogical introduction to the problem of motion 
in classical field theories. The primary focus is on self-interaction: How does an 
object’s own field affect its motion? General laws governing the self-force and self- 
torque are derived using simple, non-perturbative arguments. The relevant concepts 
are developed gradually by considering motion in a series of increasingly complicated 
theories. Newtonian gravity is discussed first, then Klein-Gordon theory, electromag- 
netism, and finally general relativity. Linear and angular momenta as well as centers 
of mass are defined in each of these cases. Multipole expansions for the force and 
torque are derived to all orders for arbitrarily self-interacting extended objects. These 
expansions are found to be structurally identical to the laws of motion satisfied by 
extended test bodies, except that all relevant fields are replaced by effective versions 
which exclude the self-fields in a particular sense. Regularization methods tradition- 
ally associated with self-interacting point particles arise as straightforward pertur- 
bative limits of these (more fundamental) results. Additionally, generic mechanisms 
are discussed which dynamically shift — i.e., renormalize — the apparent multipole 
moments associated with self-interacting extended bodies. Although this is primar- 
ily a synthesis of earlier work, several new results and interpretations are included 
as well. 



1 Introduction 

How are charges accelerated by electromagnetic fields? How do masses fall in curved 
spacetimes? Such questions can be answered in many different ways. Consider, for 
example, the Newtonian ^-body problem. This is typically solved using a certain 
system of ordinary differential equations which govern the locations of n points in M ? . 
Besides its location, each point is characterized only by its mass. This is a considerable 
abstraction from the stars or planets whose motion the fz-body problem is intended 
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to describe. Physically, each mass point is really an extended body described by the 
laws of continuum mechanics. The internal density distributions, velocity fields, and 
temperatures of these bodies might be governed by complicated sets of nonlinear 
partial differential equations. From one point of view, it is the solutions to these 
equations which represent “the motion” of each mass. 

This is not, however, the approach which is typically adopted in celestial mechan- 
ics. In that context, one instead focuses only on each body’s center of mass (and 
perhaps its spin angular momentum). These are observables which describe motion 
“in the large.” It is a central result of Newtonian gravity that much of the dynamics 
of these observables can be understood without detailed knowledge of each body’s 
internal structure. This is why the extended stars and associated partial differential 
equations of the “physical ^-body problem” can often be modeled as discrete points 
satisfying a simple set of ordinary differential equations — an enormous simplifica- 
tion. 

This work is intended as an introduction to techniques which have recently been 
developed 11-5] to similarly simplify problems of motion in a wide variety of con- 
texts. Is it possible, for example, to describe extended masses in general relativity 
using appropriately-defined centers of mass? Do these mass centers obey simple 
laws of motion? Of course, the same questions may also be asked for charged matter 
coupled to electromagnetic fields. In simple cases, appropriate laws of motion are 
well-known in both electromagnetism and general relativity: Sufficiently small test 
charges accelerate via the Lorentz force law and sufficiently small test masses fall 
on geodesics. Test body motion is also understood in cases where a body’s higher 
multipole moments cannot be neglected 16] . 

Although it has historically been difficult to relax the test body assumption 
in relativistic theories, several important cases have nevertheless been understood 
17-14] . The majority of this work has been intrinsically perturbative. It makes detailed 
assumptions about the systems to be studied and uses these assumptions at all stages 
in the analysis. Concepts like the mass and momentum of individual objects typi- 
cally arise as purely perturbative structures with no clear connection to the full theory. 
This review takes a different approach. Although approximations may be needed to 
understand specific applications, we adopt the point of view that approximating exact 
concepts is preferable to considering structures which emerge only as artifacts of a 
particular approximation scheme. We therefore focus on non-perturbative descrip- 
tions of motion. Somewhat surprisingly, considerable progress can be made from 
this perspective. Indeed, applying perturbation theory “too early” serves mainly to 
increase the computational burden and to obscure the underlying physics. 

The first step in our program is to define exact linear and angular momenta for 
arbitrary extended objects.^ It is these momenta which are used to characterize an 
object’s motion. Their evolution equations are derived without placing any significant 



^ Classical point particles are sometimes discussed as though they were the fundamental building 
blocks of all classical matter. This viewpoint is severely problematic on both mathematical and 
physical grounds, and is rejected here. That said, appropriately-regularized point particles do arise 
as mathematical structures obtained from certain well-defined limits involving families of extended 
bodies. All mention of point particles here is to be understood in this (effective) sense. 
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constraints on an object’s shape, composition, or degree of rigidity. Despite this 
generality, the methods used here are very easy to apply once the main concepts 
have been understood. Almost all difficulties lie in finding appropriate definitions 
and interpreting relations between those definitions; complicated calculations are not 
required.^ Nevertheless, many of the concepts used here are likely to be unfamiliar. 
Considerable effort has therefore been devoted to explaining these concepts slowly 
and carefully by applying them in a series of increasingly complicated contexts. 

The prototype for all of our discussion is Newtonian celestial mechanics. The 
laws of motion for this theory are reviewed in Sect. 2. This serves two purposes. First, 
Sect. 2. 1 uses standard techniques to remind the reader which ideas are important and 
why they are true. What, for example, is a self-field? Why do the self-force and self- 
torque vanish in Newtonian gravity? The methods used to discuss these questions 
make essential use of the vector space structure of Euclidean space, and cannot be 
generalized to curved spacetimes. 

Section 2.2 therefore uses Newtonian gravity as a familiar setting with which 
to introduce techniques that do make sense in curved spacetimes. The problem is 
reformulated such that no reference is made to any particular coordinate system or 
to the detailed properties of Euclidean space. All that is needed is a Riemannian 
space which admits a maximal set of Killing vector fields. It is then possible to 
introduce a “generalized momentum” which serves to describe a body’s large-scale 
behavior. The ordinary linear and angular momenta arise as two aspects of this more 
fundamental structure. Using the generalized momentum has many advantages and 
plays an essential role throughout this review. Employing it, Newtonian self-forces 
and self-torques are seen to vanish using a one-line computation which requires only 
the symmetry of an appropriate Green function. That symmetry is physically related 
to Newton’s third law. 

From this perspective, certain generalizations of Newtonian gravity may be con- 
sidered with almost no additional effort. The standard Euclidean background space 
can, for example, be replaced by one which is spherical or hyperbolic. The usual 
laws of motion still hold in these cases, except for the addition of Mathisson- 
Papapetrou spin-curvature couplings. Such terms arise kinematically even in these 
non-relativistic problems, and are shown to have a simple geometrical interpretation. 

Fully relativistic motion is first discussed in Sect. 3, although only in fiat or oth- 
erwise maximally- symmetric backgrounds. We consider for simplicity the motion 
of matter coupled to a linear scalar field. A non-perturbative self-field is defined in 
this context using a slight generalization of its Newtonian analog. Unlike in the non- 
relativistic case, forces and torques exerted by relativistic self-fields do not necessar- 
ily vanish. They instead act to renormalize^ an object’s linear and angular momenta. 
This effect is finite and non-perturbative. Physically, it represents the inertia associ- 
ated with an object’s self-field. Mathematically, it is related to the hyperbolicity of the 



^As is typical throughout physics, simple underlying principles do not imply simple applications 
to explicit problems. Applications typically do require significant computations. 

^It is common in the literature to use the words renormalization and regularization interchangeably, 
both implying the removal of unwanted infinities. This is not the usage here. Renormalization is 
intended in this review essentially as a synonym for “dynamical shift.” These shifts need not be 
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underlying field equations. Similar effects apply generically for all matter coupled 
to long-range hyperbolic fields. 

Section 4 considers motion in fully generic curved spacetimes. The Killing vectors 
used to define momenta in simpler cases must then be replaced by an appropriate set 
of “generalized Killing fields.” This is accomplished in Sect. 4.1. The scalar prob- 
lem of motion is analyzed first in this more general context, where a new type of 
renormalization is found to occur. This affects the quadrupole and higher multipole 
moments of a body’s stress-energy tensor, and may be viewed as a consequence of 
the “passive gravitational mass distribution” of an object’s self-field. Matter cou- 
pled to electromagnetic fields in generic background spacetimes may be understood 
similarly, and is discussed in Sect. 4.3. Finally, Sect. 4.4 considers motion in general 
relativity. 

Notation 

The sign conventions used here are those of Wald [15]. Metrics have positive signa- 
ture. The Riemann tensor satisfies 2V[^ = Rab/^d for any 1-form and the 

Ricci tensor is defined by Rab = Racb^ • Abstract indices are denoted using letters 
a,b, . . . from beginning of the Latin alphabet, while i, j, . . . represent coordinate 
components. Boldface symbols are used to denote Euclidean vectors and tensors. 
Units are chosen such that c = G = 1 . 



2 Newtonian Gravity 

Consider a Newtonian test body immersed in a gravitational potential 0(jc such 
a body is sufficiently small, it is well-known that its center of mass location 7 ^ at 
time t evolves via 



7f = -V0(7/,O- (1) 

This is not the correct equation of motion for (non- spherical) objects with significant 
self-gravity. Relaxing the test body assumption while still imposing an appropriate 
smallness condition results in 



7« = -V0(7f,O, (2) 

where <j) denotes that part of the potential which is determined only by masses external 
to the body of interest. Comparison of these two equations shows that the field V 0 



(Footnote 3 continued) 

infinite. Regularizations, by contrast, always refer to rules for handling singular behavior. Almost 
all discussion here focuses on finite renormalizations. Regularizations arise only in certain limiting 



cases. 
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which accelerates a large mass can differ"^ from the field V which would be inferred 
by measuring the accelerations of nearby test particles. Although it is not often 
emphasized, this is a standard result in Newtonian gravity. The well-known laws of 
motion which describe “Newtonian point masses” are, for example, equivalent to 
(2), not (1). 

A central goal of this review is to explain how similar results hold in more compli- 
cated relativistic theories. In all cases, the laws of motion are structurally identical to 
those associated with test bodies. The fields which appear in those laws of motion are 
not, however, the physical ones. Effective fields appear instead, their details depend- 
ing on appropriate notions of self-interaction. Once the precise nature of the effective 
field has been determined in a particular theory, “point particle limits” and related 
approximations follow very easily. 

Many of the difficulties encountered in the relativistic theory motion already 
appear in the Newtonian problem (where they can be so simple as to easily pass by 
unnoticed). It is therefore instructive to open this review by carefully discussing the 
Newtonian theory of motion. Section 2. 1 accomplishes this using essentially standard 
arguments. Concepts such as the self-field and self-force are emphasized, as well as 
their connections to physical principles like Newton’s third law. Similar discussions 
may be found in, e.g., [14, 16, 17]. Unfortunately, the familiar techniques used in 
Sect. 2. 1 cannot be readily applied to more complicated theories. Section 2.2 therefore 
uses Newtonian gravity as a familiar setting with which to introduce a different, more 
geometrical, approach. The resulting formulation, some of which originally appeared 
in [2, 5], does generalize. It is used throughout the remainder of this review. 



2.1 Newtonian Celestial Mechanics 

Consider an extended body residing inside a finite (and possibly time-dependent) 
region of space 53^ C which contains no other matter. The mass density p and 
momentum density^ pv of this body are both assumed to be smooth. Local mass and 
momentum conservation then imply that [17-19] 

dp 

^ + V . (pv) = 0 (3) 



^Although the gradients of 4> and <j) coincide at the center of a spherically- symmetric mass, they can 
be quite different in general. Consider, for example, a barbell constructed by joining two unequal 
spheres with a massless strut. 

^In simple cases, v represents a velocity field in the standard sense. More generally, it might 
be only an effective construction. This occurs, for example, if a body is composed of multiple 
interpenetrating fiuids. 
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and 




( 4 ) 



The Cauchy stress tensor r describes how matter interacts via contact forces, while 
the force density / describes longer-range interactions (also known as body forces). 
If the only long-range forces are gravitational, there exists a potential 0 such that^ 



The influence of masses external to 53^ may be encoded using, e.g., cj) or its normal 
derivative on the boundary 903^. 

Equations (3)-(6) are very general. They are not, however, complete. Imposing 
appropriate boundary conditions, Poisson’s equation determines 0 in terms of p, 
mass conservation evolves p using v, and momentum conservation evolves v using 
T. The stress tensor cannot, however, be determined without additional assumptions. 
Its evolution is not universal. Stresses depend on an object’s detailed composition, 
reflecting the trivial fact that different types of materials move differently. This is 
but a minor obstacle in celestial mechanics, and no particular form for r is assumed 
here. 

Observables which describe a body’s “large-scale” motion may be obtained by 
integrating the conservation laws (3) and (4). This results in the total mass m, linear 
momentum p{t), and angular momentum S{zt, t)\ 

m := pvd^jc, ^ '= p(^ ~ Zt) x vd^x. (7) 



The general philosophy of celestial mechanics is to focus on p and S while ignoring 
p and V as much as possible. The vast majority of information concerning an object’s 
internal structure is set aside; only its momenta matter. Evolution equations for these 
momenta are easily obtained from (3) and (4), which show that the mass remains 
constant and that 



/ = -P^4>- 



( 5 ) 



Inside 53^, the potential must satisfy Poisson’s equation 

= 47rp. 



( 6 ) 




P = - 



[ pWcpd- 



l^x, S = - 






p(x - Zt) X - it ^ P- (8) 



^This follows from noting that any sufficiently small piece of matter with finite density responds to 
gravitational forces as though it were a test body. 
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The gravitational force and torque acting on an extended body therefore depend on 
its mass distribution, its internal gravitational potential, and a “choice of origin” 
parametrized by Zt • 

One reason for considering p is its close relation to the center of mass position 
7 ^ . This can be defined by 



7 r •= — [ xp(x, t)d^x, (9) 

tn 

or equivalently by demanding that a body’s mass dipole moment vanish when eval- 
uated about 7 j : 



/ (x — 'jf)p(x, t)d^x = 0. ( 10 ) 

Regardless, it follows from (3) and (9) that the center of mass velocity must satisfy 

m^t = p. ( 11 ) 

Note that this is a derived result, not a definition; m, 7 ^ and p are defined in terms 
of p and V via (7) and (9). In more complicated theories, the center of mass velocity 
need not be parallel to the momentum. 

Once 7 ^ has been defined, its time evolution is easily found by combining ( 8 ) and 
( 11 ) to yield 



= — pW(j)d^x. ( 12 ) 

Evaluating S(zt , t) with Zt = 7 ^ isolates the spin component of the angular momen- 
tum from the orbital component which can appear more generally, resulting in 

S = - [ / 0 (x - 7 j) X V(j)d^x. (13) 

is, 

In astrophysical applications, p and V 0 are typically only very coarsely constrained 
by observations. Integral expressions like (12) and (13) are therefore unsuitable for 
applications. They must first be simplified. 

Such simplifications are immediate if V0 varies negligibly throughout 93^, as can 
occur if the mass in question is a test body whose dimensions are small compared with 
the distances to all other masses in the universe. In these cases, it follows from (12) 
and (13) that the center of mass acceleration satisfies (1) and that S = 0. Simplifying 
the laws of motion in more general contexts requires understanding the influence of 
an object’s own gravitational field. 
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A precise definition for the self-field may be obtained via a two-point function 
(or propagator) G(x,x') describing “the gravitational potential at x per unit mass 
at Any potential constructed from such a propagator can reasonably be called a 
self-field only if it is a Green function for the Poisson equation: 

V^G{x,x') =4 ttS^(x -x'). (14) 

There are, of course, many possible Green functions. A particular one may be singled 
out by demanding that self-fields described by G be compatible with Newton’s third 
law. Consider two distinct points x,x' e . It is then natural to interpret “the force 
on mass at x due to mass at jc'” to mean 

— p(x, t)W ^G(x, x')p(x', Od^jc'j d^jc. (15) 

The weak form of Newton’s third law states that the force at jc due to x' must be 
equal and opposite to the force at x' due to jc, which implies that 

VG(jc, x') = -y'G(x\ x). (16) 

The strong form of Newton’s third law instead requires that the force at jc due to jc' 
point along the line which connects these two points. Imposing this, 

VG(x,Jc') a JC -x'. (17) 

Any G(jc, jc') which is compatible with the strong form of Newton’s third law can 
therefore depend only on the distance |jc — jc'| between its arguments. Up to an 
irrelevant additive constant, it follows from (14) that 



G(jc, jc') = G(jc', jc) = . 

|x -x'l 



( 18 ) 



The total self-field is then 

(f>s(x,t):= [ p(x' , t)G(x, x^)d^x' = — [ d^jc'. (19) 

The physical field 0 may be viewed as the sum of the self-field (^s and an appropriate 
remainder 



( 20 ) 

It follows from (6), (14), and (19) that the effective potential 0 satisfies the vacuum 
field equation = 0 throughout 93^. 
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Now consider the total force exerted by (j)s, the “self-force.” Noting (8), it is 
natural to let this refer to 



Fs :=- [ P^(psd^x. (21) 

Substituting (19) into this expression results in an integral over the product space 
93, X 93,: 



Fs = — p(x, t)p(x' , t)WG(x, x^)d^xd^x' . (22) 

Recalling (16) or (18), the integrand is antisymmetric under interchange of x and 
x\ The Newtonian self-force therefore vanishes. This is an exact result. It holds for 
all compact mass distributions. Whatever the shape a particular body happens to be 
in 93,, the self-force vanishes because that shape is trivially symmetric when copied 
into 93, X 93,. A similar argument may be used to show that the self-torque, the net 
torque exerted by cps, vanishes as well. 

The main point of this discussion is that the net gravitational force exerted on any 
isolated extended mass satisfies 

P = - [ = - [ pW^d^x. (23) 

Not necessarily choosing z, to be the center of mass, the equivalent evolution equation 
for 5(z,, t) is 



S = - [ p(x - Zt) X -Zt X p. (24) 

The vanishing self-force and self-torque allows 0 to be replaced by 0 in the evolution 
equations for both the linear and angular momenta. Although forces and torques may 
be computed using either the physical field 0 or the fictitious effective field 0, the 
latter computation is often simpler. In most cases of practical interest, V0 varies 
far more slowly in 93, than does V0. The integral involving V0 can therefore be 
amenable to approximation when the (otherwise equivalent) integral involving V 0 
is not. 

Recalling that (p is harmonic inside the body region, it must be analytic there. 
This means that its Taylor series about an arbitrary point z, e 93, converges at least 
in some neighborhood of z,. If that series converges throughout the body, it may 
be substituted into (23) and integrated term by term. The integral expression for the 
force is then equivalent to 



oo 

Pi{t) = -mdi^izt, 0 - ^ 

n=\ 



n\ 



(Zt,t)didj^ ■■■dj^$(zt,t), 



(25) 
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where (zr , t) denotes the body’s 2 '^-pole mass moment about Zr : 

t) :=[ {x - ZtV^ ■■■{x - ZiV'^pix, r)d^x. (26) 

J% 

The series (25) is referred to as a multipole expansion for the force. A similar series 
also exists for the angular momentum. If Zt is chosen to coincide with the center 
of mass 7 j, the dipole moment ( 7 ^, t) vanishes by (10). The n = I term in (25) 
therefore vanishes as well, so 



t 1 • • 

7 J = t) V —w'‘ ■■■■'" ( 7 /, t). (27) 

m n\ 

n=2 

The utility of this expression is that there are many cases of interest where the 
multipole series can be truncated at low order without significant loss of accuracy. 
The simplest such truncation recovers (2). More generally, there are correction to 
this equation which involve a body’s quadrupole and higher multipole moments. 

Lastly, note that the moments (26) are somewhat different from the ones which are 
found in textbooks. The harmonicity of (j) implies that arbitrary traces may be added 
to the without affecting the force. The appearing in (25) may therefore 

be replaced by different moments which are trace-free in all pairs of indices. 

It is these trace-free moments which are typically used in practical calculations. 
Besides the elimination of irrelevant components, the trace-free moments are also 
useful in that they may be determined purely using external measurements of an 
object’s gravitational field. 



2.2 Reformulating Newtonian Celestial Mechanics 

The discussion which has just been presented relies heavily on the geometric pecu- 
liarities of Euclidean space. This is not essential. The only characteristic of (three- 
dimensional) Euclidean space which is truly important is that it is maximally symmet- 
ric: There exist a total of six linearly independent Killing vector fields. The Newtonian 
laws of motion are now rederived using methods which make this manifest. 

As a consequence, certain aspects of the Newtonian problem are significantly 
clarified. The geometrical nature of the linear and angular momenta is made precise, 
for example. These are shown to be two aspects of a more fundamental vector which 
lives not in the physical space, but in a space which is dual to the space of Killing 
vector fields. The approach introduced in this section also emphasizes the impor- 
tance of symmetries. It is fundamental to understanding motion in more complicated 
theories. 
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Another advantage of the reformulation discussed in this section is that certain 
generalizations of Newtonian gravity may be understood essentially “for free.” Not- 
ing that spherical and hyperbolic spaces are both maximally- symmetric, there are 
no new complications if the usual Euclidean background of Newtonian gravity is 
replaced by a space of constant curvature. It is also trivial to change the number of 
spatial dimensions, or to add, e.g., a mass term to the field equation. For concreteness, 
we restrict to three spatial dimensions and keep the gravitational field equation as-is. 
We do, however, allow the background space to be curved. This has interesting con- 
sequences which reappear in the more complicated relativistic theories considered 
in later sections. 



2.2.1 Geometric Preliminaries 

The locations of Newtonian events may be viewed as points in a four-dimensional 
manifold A4. While a relativistic spacetime is defined using only a manifold and a 
non-degenerate metric, Newtonian spacetimes require more structure [18-20]. One 
such structure is a preferred notion of time. This takes the form of an equivalence 
class^ of functions which associate each event in spacetime with “the time” at which 
it occurs. Associated with this is a preferred foliation of A4 into a one-parameter 
family of hypersurfaces {St}. These are the spaces of constant time. 

Newtonian spacetimes are difficult to work with directly. They simplify consider- 
ably in the presence of a frame, a structure that identifies events at different times as 
being at “the same” spatial point. It is assumed here that a frame has been fixed in such 
a way that all St are mapped into a single space consisting of a three-dimensional 
manifold S together with a (fixed) Riemannian metric gab- This process also fixes a 
particular time function. It permits all physical quantities in spacetime to be viewed 
as time-dependent quantities on S. We allow the spatial metric to be curved, but 
assume that its curvature is everywhere constant. Letting and Rabc^ denote the 
covariant derivative and Riemann tensor associated with gab, ^a^bcd^ = 0- This 
implies that (S, gab) is maximally symmetric. 

Consider the motion of a material object instantaneously confined to a subman- 
ifold C 5 which contains no other matter and has finite volume. Denote this 
body’s mass density at time t by p(-, t) and its velocity field at time t by n^(-, t). 
Local conservation of mass and momentum continue to hold in this context, so (3) 
and (4) carry over essentially without change: 



^For any single time function T : At M and any c,d eR such that c > 0, the map cT + d is 
also an acceptable time function. 




(28) 



(29) 
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The gravitational potential 0 which appears here satisfies the obvious generalization 
of Poisson’s equation: 



V“Va</. = g‘^^VaVb4> = 47rp. 



(30) 



2.2.2 Generalized Momentum 

Our first significant departure from the elementary discussion of Newtonian motion 
found in Sect. 2.1 arises in the definitions for a body’s linear and angular momenta. 
The usual integrals (7) make sense only when evaluated in a Cartesian coordinate 
system. Alternatively, they require a canonical identification of tangent spaces asso- 
ciated with different points in the spatial manifold. While this is easily accomplished 
in Euclidean space, it is not obvious what to do more generally. Our first task is 
therefore to define momenta which do not make reference to a specific coordinate 
system. Accomplishing this provides a notion of momentum which is easily gener- 
alized to curved Newtonian backgrounds, and even to completely generic relativistic 
spacetimes. It is a basic building block for all results discussed in this review. 

One problem with elementary definitions of mechanical momentum is that they 
attempt to represent this concept via a spatial vector or covector. This is physically 
unnatural (except for point particles or momentum densities). Momenta are asso- 
ciated with extended regions, not individual points. There is no natural tangent or 
cotangent space in which to place the momentum contained in an extended region 

C 53^. The simplest mathematical structure with which to represent a quasi-local 
quantity must itself be quasi-local. Spatial tensors are not, of course, examples of 
such structures. 

Besides being quasi-local, momenta must also be extensive. For any two disjoint 
regions 93 1 , 912 C 93^ which are “physically independent,” there must be a sense in 
which 

(momentum in 93i) + (momentum in 932) = (momentum in 93i U 932) (31) 

for some binary operation “-h” which is both associative and commutative. If 93i and 
932 are identically prepared, it is also natural to suppose that 

(momentum in 93i) + (momentum in 932) = 2(momentum in 93i). (32) 

This motivates a notion of scalar multiplication. 

Together, these considerations and others suggest that momenta should be ele- 
ments of a vector space. The most natural vector space is not, however, the space of 
tensors at any particular spatial point. A better choice may be motivated by recall- 
ing that conserved linear momenta arise naturally in theories which are derived 
from translation-invariant Lagrangians. Similarly, conserved angular momenta arise 
from Lagrangians which are invariant with respect to rotations. This suggests that 
both types of momenta can be associated explicitly with a collection of continuous 
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symmetries. Consider, in particular, those symmetries — the continuous isometries — 
which preserve the spatial metric. While these are not necessarily symmetries for all 
physically-interesting quantities, they are extremely useful. 

The continuous isometries of a Riemannian space (S, gab) are generated by its 
Killing vector fields. By definition. 



= 0 (33) 

for every Killing vector , where denotes the Lie derivative with respect to . 
We use K to denote the collection of all Killing vector fields together with obvious 
notions of addition and scalar multiplication. This is a vector space. Moreover, the 
dimension of this vector space is finite. If the dimension of the physical space is 
dim 5 = dim 93^ = N, it may be shown that (see, e.g.. Appendix C of [15]) 

1 

dimK < -A(A-h 1). (34) 

This section restricts attention to maximally- symmetric spaces where dimK = 
^N{N + 1). When N = 3, Euclidean, spherical, and hyperbolic spaces are all 
maximally- symmetric. They admit six linearly-independent Killing fields. Given a 
preferred point, three Killing fields may be interpreted as translations and three as 
rotations. This makes sense only near the given point, and is best avoided at this 
stage. Doing so implies that the linear and angular momenta should be treated as 
elements of a single object “conjugate to” the space of all Killing vector fields. 

Consider a representation of a body’s momentum as a vector in the space K* 
which is dual to K. An element of K* is, by definition, a linear map from K to M. 
The specific linear map which has the desired properties is 

Ptimo- [ p(x,t)va(x,t)eMdV, (35) 

where e K and the volume element is the natural one associated with gab- We call 
this the generalized momentum contained in 93 c 93^ < 2 ^ time L It is often convenient 
to omit the dependence on 93, in which case it is to be understood that Pf = R? [93^]. 

The dimension of K* is equal to the dimension of K, so this momentum has six 
components in three spatial dimensions. These components correspond to the usual 
three components of linear momentum and three components of angular momentum. 
Such a split can be made explicit by introducing additional structure, namely a 
preferred point Zt ^ 93^. For any such point, R^[93] can be re-expressed in terms 
of spatial tensors pa, at zt- This is explained in Sect. 2.2.6. For now, it suffices 
to consider Rr[93] on its own. While the introduction of a preferred point allows 
this map to be replaced by spatial tensors, avoiding such representations whenever 
possible provides considerable calculational and conceptual simplifications. 
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In relativistic contexts where there exists a maximally-symmetric background 
geometry, the generalized momentum remains essentially unchanged. The infinites- 
imal momentum pVadV is merely replaced by Ta^dSt,, where is an appropriate 
stress-energy tensor and dSb is the natural volume element on a three-dimensional 
hypersurface. If a spacetime is not maximally-symmetric, one also replaces K by 
another vector space which has the correct dimensionality. The “generalized Killing 
fields” used for this purpose are discussed in Sect. 4.1. 



2.2.3 Generalized Force 



How does Newtonian gravity affect the time evolution of the generalized momentum? 
Using local momentum conservation (29) and assuming that the boundary is 
independent of time (or that there is no matter there). 







T‘'^)C^9ab 



dV = - 



[ pC^<pdV. 



(36) 



The second equality here follows from Killing’s equation (33). If Cpcj) = 0 for 
some specific Killing field it is clear that the associated momentum Pt[fR]('ip) is 
conserved. This means that if (p is constant along a translational Killing field, there 
can be no force in that direction. Similarly, a field which is invariant about rotations 
around a given axis exerts no torque about that axis. Both of these statements are 
physically obvious. They are also of limited value. Once the field equation (30) is 
taken into account, Cp(p = 0 implies that Cpp a CpV^Va(p = = 0 as 

well. This is clearly impossible for any compact body if is a pure translation. 
Rotational symmetries fare better, although they are still a rather special case. 

Transforming (36) into a surface integral results in a more interesting conservation 
law. Using the field equation and integrating by parts shows that 

= T\(!>dSa, ( 37 ) 

d? /9JH 

where 

Tab '■= T(Va(/)V^^ - \gab'^'' (t>'^ c4>) (38) 

47T 2 

is the stress tensor associated with (p. At least in fiat space, one might imagine 
extending dR (and perhaps d^t) far outside of all matter of interest. If <p falls off 
sufficiently fast in this region, the surface integral can be seen to vanish. The gen- 
eralized momentum is therefore conserved in such cases. Of course, the momentum 
associated with a single object in a larger system is not conserved. Understanding its 
dynamics requires a different argument. 
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2.2.4 The Self-field 

The generalized force (36) involves the physical field <p. As discussed in Sect. 2. 1 , this 
is too complicated to work with directly. We therefore isolate its most complicated 
part — the self-field — and compute what it does directly. Once this is accomplished, 
the remaining undetermined portion of the force is relatively simple to understand. 

The self-field in this context is defined in Sect. 2.1 in terms of a certain two-point 
function G. This must still be a Green function. If G(x, x') = G(x\ x), the two 
constraints (16), (17) which implied a notion of Newton’s third law in Euclidean 
space generalize to the statement that 



for all G In the Euclidean case, translational invariance alone implies the 
weak form of Newton’s third law. Eurther imposing rotational invariance recovers 
the strong form of Newton’s third law. In general, though, symmetries of G imply 
only “portions of” Newton’s third law. 

It is always possible to find Green functions which satisfy (39) in maximally- 
symmetric backgrounds. Indeed, these Green functions depend only on the geodesic 
distance between their arguments. Introducing Synge’s function (also known as the 
world function) [10, 21, 22] 



the Euclidean Green function (18) can be written as G = — 1/V^. Green func- 
tions associated with spherical and hyperbolic spaces are merely more complicated 
functions of a [23]. In any of these cases, C^G oc = 0. 

Using the symmetric Green function which satisfies (39) to define the self-field, 
let 



£^G(x, x') = + e\x')Va']G{x, x') = 0 



(39) 





(41) 



Substituting this into (36) shows that 




- [ dV [ dV'p(x,t)p(x\t)jC,G(x,x') 
2 703 , J^t 



/ p(x, t)C^^(x, t)dV, 



(42) 



342 



A.L Harte 



where cj) = cj) — cj)^ and 9^ has been replaced by the entire body region . It is clear 
from this that the self-force and self-torque both vanish as an immediate consequence 
of (39). All forces and torques may therefore be computed using 0 instead of 0. 
Furthermore, the effective field satisfies the vacuum equation 

V«V«0 = O. (43) 

It can clearly be computed by subtracting the self-field from the physical field. Alter- 
natively, Stokes’ theorem may be used together with (43) to write 0 as a kind of 
average of 0 over a closed surface which surrounds the body of interest. 

It has already been mentioned that Pf{^) is conserved if C^cj) = 0. Equation (42) 
shows that this also true if = 0, a much weaker condition. For a closed system, 
one typically has 0 = 0s and hence 0 = 0. All components of the generalized 
momentum are therefore conserved in such cases. 

Equation (42) has been established by showing that the generalized force exerted 
by 0s always vanishes. This force involves an integral over fSt x 93^, and may 
therefore be interpreted as a two-point interaction. It can sometimes be interesting 
to also consider interactions between three or more points. Let 

^max p p 

^s(xj) :='^C„ dVi - dV„p{yi,t)---p(yn,t)Gn(x,yi,...,y„), (44) 

where the Cn are arbitrary constants and the (n -h 1) -point propagators Gn are symmet- 
ric in their arguments and satisfy C^Gn for all G It is straightforward to show 
that the generalized force exerted by any such field vanishes. Given the two-point G 
used to define 0s, an appropriate three-point interaction may be chosen using, e.g., 

Gs(x, y, z) = G(x, y)G(y, z)G(z, x). (45) 

Other choices are also possible, of course. Higher-order propagators typically lead to 
fields 0s which are not really Newtonian self-fields in the sense that 0s ^irp. 
Series like (44) can nevertheless be useful for understanding different theories where 
matter couples to nonlinear fields. In those cases, the sum in 0s might be compared 
to a kind of Dyson series for an object’s self-field. Regardless of the field equation, 
however, the existence of a Killing field which satisfies G^(0 — 0s) = 0 for some 
0s always implies that Pt (0) is conserved. Although this conservation law might be 
manifest only for a particular choice of 0s, the value of Pt(ip) does not depend on 
that choice. 
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2.2.5 Multipole Expansions 



Returning to the main development, note that (36) and (42) differ only by the replace- 
ment 0^0. Although both of these integrals are numerically equivalent, the lat- 
ter is often simpler to evaluate. This is because can be readily approximated 
throughout 53^ in many more physically-interesting situations than can Such 
approximations are based on a Taylor expansion of 0. While this has an obvious 
meaning in Euclidean space, a technical diversion is needed to explain what is meant 
by Taylor expansions more generally. 

Given an origin Zt ^ about which a particular Taylor expansion is to be per- 
formed, the most natural Cartesian-like coordinate systems are the Riemann normal 
coordinates with origin Zt- These are unique up to rotations, and may be used to 
perform Taylor expansions in the usual way. 

To be more precise, recall that the exponential map exp^ = x' takes as input 
a point V and a vector at that point. The point x' which is returned is found by 
considering an affinely-parametrized geodesic yu satisfying yo = -^ and = X^. 
The point x' is then equal to yi. An equivalent statement may be expressed using 
Synge’s function (40). Letting (Ta(x\ x) denote V^cr(x^ x), 

exp^[— cr^(v^ x)] = x'. (46) 



First derivatives of Synge’s function therefore generalize the concept of a “separation 
vector.” The jc' — jc of a conventional Taylor series in Cartesian coordinates naturally 
turns into —a^(x',x)in more general contexts. If a scalar field A(v) is to be expanded 
in a Taylor series about some v, it is convenient to first rewrite this as a function on 
the tangent bundle by defining 

A(x,X^) := X(Qxp^X^). (47) 



Now let the ^th tensor extension of A at v be 



b\ 1 



•= 



■ d^A(x,X^) 

• • • dX^n 



. x^=o 



(48) 



This is the unique tensor field which reduces to n partial derivatives of A in a Riemann 
normal coordinate system with origin v. In fiat space, X^ai-an = ^ai • * * A. More 
generally, the curvature can appear. Further discussion of tensor extensions may be 
found in [4, 6]. 

Combining all of these concepts, a natural Taylor series for 0 which applies 
regardless of the background geometry is 



}{x\ 0 = X! j— Zt)--- Zt)^,ai-aniZt. 0- 

n\ 

n=0 



(49) 
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All distances are assumed to be sufficiently small that a remains single- valued and its 
derivative is well-defined. Furthermore, a Taylor series like this is — even if it does 
not converge everywhere of interest — assumed to be at least a useful asymptotic 
approximation throughout 53^ . Substituting (49) into (42) and integrating term-by- 
term results in a multipole expansion for the generalized force. Noting that 

= 0 (50) 

for any Killing field , the multipole expansion for the generalized force is 
d 1 

:rPtiO = - (51) 

at n\ 

n=a 

The mass moments which appear here depend on p via 

:= (-1)" / a‘'‘(V,Zf)---o-“"(V,Zf)/9(x',0dV. (52) 

It follows from (28) that the zeroth moment, the mass, is independent of time. Conser- 
vation laws do not, however, fix the evolution of the higher moments. These depend 
on the type of matter under consideration. 

If £^0 = 0 for some Killing field it follows that Cp^^av-an = 0 foi* 

The conservation of Pt ('ip) in such a case is therefore preserved by any approximation 
which truncates the multipole series at finite n. This is an important property which 
helps such approximations have accurate long-time behavior. 



2.2.6 Linear and Angular Momenta 

Thus far, Pj = Pt [93^] has been loosely described as equivalent to a body’s linear and 
angular momenta at time t. Similarly, time derivatives of the generalized momentum 
have been interpreted as “forces and torques.” These identifications are now made 
precise. 

Recall that the generalized momentum is a vector in K*, the vector space dual to 
K. While it is productive to view Pt simply as a linear map from K to M, it can also 
be useful to find its components with respect to a particular basis. It is in this context 
that the linear and angular momenta arise in their more familiar form. 

A basis for K may be found by recalling that knowledge of a Killing field and its 
first derivative at any one point fixes it everywhere [15]. Choosing an arbitrary point 
V, the space of Killing vectors is in one-to-one correspondence with the space of 
all 1- and 2-forms at v. There exist two-point tensor fields x), ^^(x\ x) 

such that 



p(x') = 3‘^'‘^(x',x)Aa + 3'^''^\x',x)Bab 



(53) 
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is an element of K for any Aa and any Bab = - The “Killing data” satisfies 

^ab — ^[ab] — ( 54 ) 



In a physical space of dimension N, there exist N linearly independent 1 -forms 
and N(N — l)/2 linearly independent 2-forms. Together, these generate the requi- 
site N(N l)/2 linearly independent Killing vectors. In Euclidean space and in a 
Cartesian coordinate system, 



x) = S"', x) = ix' - xfSjV. (55) 



More generally, “ and are related to the geodesic deviation equation and 
form a basis for K. They can be computed using the first two derivatives of Synge’s 
function [24]. Defining aab ■= '^b<Xa = (Jaa’ ■= ^a'<Xa, and H“'a := 



r^a'a Tja' ^ba r^a'ab zjci'[a^b] 

= H b^ 9 ^ = H (T . 



(56) 



Substituting (53) into (35) shows that Pt (0 can be written as a linear combination 
of and V^^^(x). The coefficients in this combination are identified with the 
linear momentum p^(x, t) and the angular momentum bivector t)\ 

PtiO = t)Ux) + \s‘‘‘’(x, tWa^bix). (57) 

This is an implicit definition. Varying amongst all possible and recovers the 
explicit formulae 



p^(x,t)= / p(x' , t)Va'(x' , t)H^ b(^\ (58) 

S“'’(x,0=2/ p(x\t)Va'ix\t)H°'^‘^(x',x)a‘’\x\x)dV'. (59) 

is, 

In three spatial dimensions, the angular momentum bivector is dual to an angular 
momentum 1-form Sa via 



5a = ^€abcS‘’P (60) 

Introducing Cartesian coordinates in a fiat background, it is easily verified that the p^ 
and Si derived from Pt in this way reproduce the elementary definitions (7). Explicit 
coordinate expressions are more difficult to obtain in curved backgrounds, but these 
are rarely necessary. 



346 



A.L Harte 



Thus far, the spatial curvature has played no explicit role in any of our discussion. 
It does appear, however, in the evolution equations for and . First note the 
general identity [15] 



— ^bca 



(61) 



which holds for any Killing field Time derivatives of the Killing data (Aa, Bab) 
along a path Zt therefore satisfy 



D 
dr^ 



a — Zf Bjya , Bab 



D 



Babe ZfA(i. 



(62) 



These are known as the Killing transport equations [15, 25]. They are ordinary 
differential equations which can be used to relate Killing data at one point to Killing 
data at another point. 

Consider linear and angular momenta defined about some Zt, so, e.g., = 

p^izt, t). Substituting (54) and (62) into (57) then shows that 






-RhefS^^zf 






V.6 = ^/riO 



(63) 



for all e K. Varying over all Killing vector fields recovers the individual evolution 
equations 



+ F\ 5"'’ = (64) 

The force and torque appearing here are determined by match- 

ing appropriate coefficients in dPt/dt. Integral expressions follow from (42), (53), 
and (56): 



Fa = - [ pH^'’’<JabVa'kV, (65) 

Nab = -2 / (66) 

These expressions are exact. Their multipole expansions follow from (51): 

00 ^ 

Fa = -Y (67) 

OO rs 

n=0 



( 68 ) 
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Note that the velocity zf of the (arbitrarily-chosen) origin does not appear in Fa 
or Those portions of (64) which do involve the velocity are spatial analogs of 
the Mathis son-Papapetrou terms typically used to describe the motion of spinning 
particles in general relativity. It is apparent here that similar terms arise even in 
non-relativistic theories. Their origin is essentially kinematic, being related to the 
decomposition of K into pure translations and pure rotations. It follows from (57) 
that Paizt, t) is associated with Killing vectors which appear translational at Zt in 
the sense that V^^^(z^) = 0. Similarly, Sabizt, t) is associated with Killing fields 
which are purely rotational in the sense that ^a{zt) = 0. The Mathisson-Papapetrou 
terms arise in the laws of motion because, e.g., a Killing vector which is purely 
translational at Zt is not necessarily purely translational at a neighboring point Zt+dt- 
A given Killing field may have different proportions of “translation” and “rotation” 
at different points, and this inevitably mixes the evolution equations for and . 
A simple version of this effect occurs even in fiat space, where a pure rotation about 
one origin is not necessarily a pure rotation about another origin. This explains the 
p^^z^^ term in (64) and the —Zt^p term in (24). 

Essentially the same explanation for the Mathisson-Papapetrou terms applies in 
general relativity. In that case, the spacetime may not admit any Killing vectors at all. 
Regardless, there still exists a ten-dimensional space of “generalized Killing fields” 
as described in Sect. 4.1. Given a particular event, these are naturally decomposed 
into a four-dimensional space of translations and a six-dimensional space of rota- 
tions and boosts. Whether or not a particular generalized Killing field is, e.g., purely 
translational varies from point to point just as it does for ordinary Killing fields. The 
evolution equations for relativistic momenta therefore acquire velocity-dependent 
terms which are closely analogous to those which appear in the generalized New- 
tonian theory discussed here. 

Confining attention only to the generalized momentum whenever possible avoids 
the complications associated with the Mathisson-Papapetrou terms. It also simplifies 
the discussion of conservation laws. Recall that the presence of a particular spatial 
Killing field which satisfies £^0 = 0 implies that Pti'ip) must be conserved. 
It follows from (57) that a particular linear combination of pa and Sab must be 
conserved as well: 



This constant is independent of Zt • Its existence implies that a particular combination 
of forces and torques must vanish. Specifically, comparison with (63) shows that 




(69) 



= 0 . 



(70) 



Although these results could be deduced directly from (64)-(66), they are consider- 
ably more clear from the perspective of the generalized momentum and its evolution 
equation (42). 
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2.2.7 Center of Mass 

The laws of motion for and have left Zt undetermined. One convenient choice 
is to set Zt = "ft ^ where 7 ^ denotes the body’s center of mass at time t. 

This is straightforward when the background space is flat. It is then standard to 
define the center of mass to be the origin about which the mass dipole moment van- 
ishes: m^(jt, t) = 0. Enforcing this while differentiating (52) recovers the standard 
relation = myf between an object’s velocity and its linear momentum. Using 
(64) and (67), 



This is equivalent to (27). 

Similar results do not appear to hold when the background space is curved. It is 
still possible to demand that the dipole moment vanish [which, among other benefits, 
eliminates the n = 1 term in (67) and the n = 0 term in ( 68 )]. The velocity of such 
a trajectory may be shown to satisfy 



The integral on the left-hand side of this equation can (typically) be inverted to yield 
an explicit expression for 7 ^ . Unfortunately, the result does not depend on p^ in any 
simple way. Simplifications are possible when a body’s dimensions are much smaller 
than the curvature scale. In these cases and a' can be expanded in Taylor series 
about 7 ^, yielding the ordinary momentum- velocity relation at lowest order. More 
generally, this is problematic. Higher-order corrections require more information 
about the body than is required for the evolution equations of the momenta alone. 
Moments of a body’s momentum distribution — its “current moments” — are required 
together with its mass moments. 

It is only in this very last step where a celestial mechanics of “curved New- 
tonian gravity” appears to be problematic. Similar complications do not arise in rel- 
ativistic systems. Among other differences, the presence of boost-type Killing fields 
(or their generalizations) provide additional constraints which imply well-behaved 
momentum- velocity relations. 

2.2.8 Equations of Motion 

Results such as (64) are properly described as laws of motion, not equations of motion 
[26, 27]. They are incomplete. Even if Zt is chosen as a body’s center of mass, we 
still have not described how to compute (p or the higher multipole moments. 




n\ 



7 / p{x',t)a\(x\jt)dV = - [ p(x\t)v‘‘'(x',t)a\>(x\-f,)dV'. (72) 
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The traditional approach is to introduce smallness assumptions. Consider, for sim- 
plicity, the ^-body problem in flat space. If a particular body in such a system has 
characteristic size I and mass m, its 2'^-pole moments must be smaller^ than approxi- 
mately Letting r denote a minimum distance between bodies and assuming that 
all masses are comparable, the n\h term in (71) is at most of order j {t j vY . 
Considerable simplifications therefore result if £ ^ r. At lowest order, only the 
monopole term is retained in the law of motion. Furthermore, each 0 may be com- 
puted in this approximation by assuming that all other masses are pure monopoles. 
This recovers the typical Newtonian ^-body equations of motion. More details may 
be found in, e.g., [16, 17]. 



3 An Introduction to Relativistic Motion 

Despite being considerably more abstract than the traditional presentation of New- 
tonian gravity, the formalism which has just been described is very powerful. It does 
not rely on any particular coordinate systems, and the majority of the discussion 
doesn’t even require that the metric be fiat. Indeed, most of the results well-known in 
ordinary Newtonian gravity continue to hold in generalizations of this theory which 
employ spherical or hyperboloidal geometries. It is also trivial to change the num- 
ber of spatial dimensions, or even to amend the field equation in certain ways. It is 
physically more interesting, however, to consider motion in relativistic theories such 
as electromagnetism or general relativity. 

This section describes how the formalism of Sect. 2.2 generalizes for objects 
coupled to relativistic fields. For simplicity, we consider the motion of an extended 
mass coupled to a scalar field 0 which satisfies the Klein-Gordon equation 

(V'^V«-/i2)<^ = 47Tp. (73) 

/i represents a (constant) field mass and p the body’s charge density. Following the 
Newtonian problem as closely as possible, the four-dimensional background space- 
time (M,gab) is assumed to be maximally- symmetric. Understanding motion in 
more general curved spacetimes requires eliminating our reliance on a maximal set 
of Killing vector fields. This is indeed possible, but is somewhat technical. Its dis- 
cussion is therefore delayed to Sect. 4. Motion in electromagnetic fields is discussed 
there as well. 

Scalar charges in maximally- symmetric spacetimes provide a simple example 
with which to introduce the relativistic theory of motion. They differ from their New- 
tonian counterparts in only one important respect: Self-forces no longer vanish. Still, 



^ Large astrophysically-relevant objects like planets tend to be very nearly spherical due to the 
limited shear stresses which can be supported. The trace-free components of the moments, which 
are all that couple to the motion, are then much smaller than ml ^ . These tend to be induced mainly 
by rotation and external tidal fields, and are typically modeled using Love numbers. 



350 



A.L Harte 



self-forces are “almost ignorable.” They effectively renormalize a body’s momen- 
tum, but do nothing else.^ This is a finite renormalization, meaning only that self- 
forces conspire to, e.g., make the mass computed by integrating over a body’s stress- 
energy tensor differ from the mass inferred by watching how that body accelerates 
in response to external fields. 

Physically, renormalization arises because as a charge accelerates, its field must 
be accelerated as well. Although portions of that field may break away as radiation 
or otherwise change, there is a sense in which charges and their fields remain insep- 
arably coupled. The energy contained in a body’s self-field implies that it must resist 
acceleration and contribute to that body’s inertia. 

Now, self-forces vanish in Newtonian theory because of Newton’s third law. The 
self-field is sourced by a body’s instantaneous mass distribution and exerts forces 
on that same mass distribution. Interactions are no longer instantaneous, however, 
in theories which involve hyperbolic field equations. Fields are sourced by charge 
in a four-dimensional region of spacetime, but act only on configurations in three- 
dimensional hypersurfaces. It is impossible to maintain an exact concept of “action- 
reaction pairs” in this context. The imbalance which results turns out to exert forces 
and torques which precisely mimic changes to a body’s linear and angular momenta. 
This type of effect is generic for any coupling to long-range fields which satisfies 
hyperbolic field equations (or otherwise depends on a system’s history). 



3.1 Relativistic Continuum Mechanics 

The simplest relativistic modification of Newtonian gravity^^ involves objects with 
scalar charge density p interacting via a field 0 which satisfies the wave equation (73). 
Suppose that the body of interest resides inside a worldtube W d M containing no 
other matter. Also assume that 2U is spatially bounded in the sense that its spacelike 
slices have finite volume. 

Our description for the motion of a compact object is based on its stress-energy 
tensor This encodes many of a body’s mechanical properties, and is analogous 
to the (p, triplet used to analyze Newtonian objects in Sect. 2. Like those 

variables, the stress-energy satisfies differential equations which are independent 
of the specific type of material under consideration. Although these laws do not 
determine completely, they do provide significant constraints. 

If 0 vanishes everywhere and there are no other long-range fields, = 0. 

More generally, scalar fields contribute to a system’s total stress-energy. It is only 



^The notion of self-force used here is consistent with the usual Newtonian definition, but is uncon- 
ventional in relativistic contexts. Its precise meaning is made clear below. 

^®This is to be considered as a model problem. If interpreted as a theory of gravity, the type of scalar 
field theory described here is not compatible with observations. Of course, it is not necessary to 
interpret (/) as a gravitational potential (so p needn’t be a “mass density” in any sense). 
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this total 7 ’jpI’ = which is necessarily** conserved: 



= 0 . 



(74) 



Consider splitting this total into “body” and “field” components: 



’-rcib 'T'Cib I 'T'Cib 

^tot — Afield ^body 



(75) 



Away from any matter, it is clear that = 0 and 



'T'Clb 

^fie*d - ^ 



v«0vV - -g^\v,4>v^4> + 



(76) 



Elsewhere, local interactions between the matter and the field make it physically 
difficult to motivate any particular split. 

One possible way forward is to work only with . Unfortunately, the momen- 
tum obtained from this stress-energy tensor might be very different if computed 
first in a slice of 2H, and then in a slightly larger hypersurface. There is no natural 
boundary where integrations can be stopped. Although momentum integrals might 
settle down when performed over very large volumes, it is unclear how useful this 
is. The relevant distance scale could be so large that the only “total momenta” which 
are useful encompass the entire (modeled) universe, precluding any ability to learn 
about the dynamics of individual masses. Results based on alone can be useful 
in certain approximations involving the motion of very small bodies [28], but this is 
insufficiently general for our purposes. 

The approach adopted here is mathematically the simplest. Let be given 
by (76) throughout W. The remaining stress-energy tensor is then defined to be the 
body’s: := Equations (73)-(76) imply that this satisfies 

VfcTtody = -PV>, (77) 

which generalizes the Newtonian conservation laws (28) and (29). 



3.2 Generalized Momentum 

Recall the generalized momentum (35) defined for Newtonian mass distributions. 
This requires very little modification for use in relativistic systems. The one com- 
plication which does arise is that there is no longer any preferred notion of time. A 
time parameter must be supplied as an additional ingredient, which is accomplished 



a Lagrangian formalism, the total stress-energy tensor considered here is derived from a func- 
tional derivative of the action with respect to the metric. It is conserved whenever the action is 
diffeomorphism-invariant [15]. 
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by foliating W with a 1 -parameter family of hypersurfaces {^ 5 }. Each 53^ may be 
viewed as the body region at time 5 ', and is assumed to have finite volume. The pre- 
cise nature of the body regions may be considered arbitrary for now. They can be 
spacelike or even null.^^ 

Supposing that a particular foliation has been fixed, the generalized momentum 
contained in any three-dimensional region c 53^ is most obviously defined as 



where is any Killing vector field. Pg [93] (•) defines a linear map on the space of K of 

Killing vector fields. It is therefore a vector in the dual space K*. For the maximally- 
symmetric four-dimensional spacetimes considered here, dim K = dim K* = 10. 
Given a particular event, four of these dimensions correspond to translations and six 
to rotations and boosts. As in the Newtonian case, such decompositions allow the 
generalized momentum to be expressed in a basis which recovers linear and angular 
momenta associated with a preferred event. The details of this correspondence are 
described more precisely in Sect. 3.7. 



3.3 Generalized Force 

Forces and torques are determined by 5 ' -derivatives of the generalized momentum. 
Considering only the momenta in 93^ , it is convenient to simplify the notation by 
defining Ps = /^ 5 [ 935 ]. Using (77) together with Killing’s equation then shows that 



for all e K, where dS := t^dSa and denotes a time evolution vector field for 
the foliation {93^}. The relativistic generalized force (79) is essentially identical to 
its Newtonian analog (36). As in that context, the force can be immediately put into a 
practical form only if the object of interest does not significantly contribute to 0. More 
generally, the self-field introduces considerable complications. Progress is made by 
finding a precise definition for the self-field, computing its effects analytically, and 
then subtracting it out. The “effective field” which remains after this process is 
typically much simpler to analyze than the physical one. 



^^Consider, e.g., the past-directed light cones associated with a timelike worldline. 




(78) 




(79) 
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3.4 The Self-field 



At least in part, the Newtonian self-field (19) can be generalized essentially as-is. 
Let the relativistic self-field 0s be obtained by convolving an object’s charge density 
with a particular two-point^^ scalar G. This must be a Green function, so 

(V" Va - i?)G{x, x') = Att5{x, x'). (80) 

Still more constraints are necessary to fix G uniquely. One of these follows from 
requiring that the self-field^"^ depend only quasi-locally on a body’s “instantaneous” 
configuration. It should not, for example, involve distantly-imposed boundary con- 
ditions, the behavior of other objects, or a body’s history in the distant past. Such 
conditions can be ensured by demanding that G(x,x') = 0 whenever v and x' are 
timelike- separated. Lastly, suppose that G(x,x') = G(x',x). Such objects exist 
(at least in finite regions), are unique, and are referred to as S-type or “singular” 
Detweiler- Whiting Green functions [10, 29]. In the maximally- symmetric back- 
grounds considered here, G satisfies C^G = 0 for all g It implies a rela- 
tivistic form of Newton’s third law. For massless fields in Minkowski spacetime, 
G=^(G+-hG_) where G± are the advanced and retarded Green functions. More 
generally. 



G= ^(G+ + G_-y) (81) 

for a certain symmetric biscalar V(x,x') which satisfies the homogeneous field 
equation. G can also be expressed in terms of Synge’s function a. Using A to denote 
the van Vleck determinant [10] (which depends on second derivatives of cr), (5 the 
Dirac distribution, and © the Heaviside distribution, 

G = 1[a 1/2,5((7) - y0((7)]. (82) 

This shows that G(v, •) can have support on and outside the light cones of v. 

The self-field “due to” charge contained in a given spacetime volume 91 c 
can now be expressed in terms of the S-type Detweiler- Whiting Green function: 

0s[9f](x)= [ G(x,x')p(x')dV'. (83) 

Jm 

If the argument 93 is omitted, the integral is understood to be carried out over an 
object’s entire worldtube 9U. 



^^It is also possible to introduce (n -f l)-point self-fields similar to (44). This is not considered any 
further here. 

^"^The term self-field is used in several different ways in the literature. The definition adopted here 
is uncommon, and is sometimes described as the “Coulomb-like” component of the self-field. 
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We now compute the self-force. It simpler not to consider this directly, but rather 
its integral over a finite interval of time. Letting Sf > S{, it is clear from (79) that 



Pst iO - Ps,(0 = = -J^pCi4>dV 



(84) 



for any e K. The 4- volume T = T(5'i, c which appears here represents that 
part of an object’s worldtube which lies in between the initial and final hypersurfaces 
. See Fig. 1. Substitution of the self-field definition (83) into (84) shows that 
the total change in momentum due to 0s alone is 

[ dV [ dV'f(x,x'), (85) 

J3 Jmj 



where 



f(x, x') = -p(x)p(x')^'^(x)WaG(x, x') (86) 

may be interpreted as the density of generalized force exerted at x by Indepen- 
dently of any specific form for /, double integrals with the form (85) can be rewritten 
as 



1 

2 



[ dv( [ dV'[f(x, x') + fix', X)] + [ dV'[f(x, x') 
J3 \JW JW\3 



f(x',x)] 



) 



(87) 



whenever the relevant integrals commute. This identity is very general, and is cen- 
tral to understanding motion in every relativistic theory we discuss. It is therefore 
worthwhile to examine it in detail. 




Fig. 1 Schematic illustrations of a body’s worldtube 2IT together with hypersurfaces 03^. and 53^^ 
(drawn spacelike). The region X(si, Sf) C 22T bounded by these hypersurfaces and appearing in 
(84) is indicated. The shaded 4-volumes and ^sf [see (97)] denote the domains of dependence 
associated with the self-momenta and defined by (90). Although Pgf (0 — Psi iO depends 
on throughout 3(si , ^/), it depends on more complicated aspects of a body’s internal structure 
only in the shaded regions. These contributions are always confined to within approximately one 
light-crossing time of the bounding hypersurfaces, and are therefore “quasi-local” 
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The first term in (87) can be interpreted as an average of “action-reaction pairs” 
in the sense of Newton’s third law. It is very similar to the types of identities used 
to simplify the motion of objects coupled to elliptic fields in Sect. 2. Recalling that 
discussion, the reciprocity relation G(x, x') = G(x', x) implies that 

[ dV [ dV'[f(x,x')-\-f(x\x)] = - [ dV [ dV' p(x)p(x')C^G(x, x'). (88) 
J3 Jmj J3 Jmj 

As in the Newtonian theory, Lie derivatives of G are associated here with sums 
over action-reaction pairs. As in that case, these sums vanish. All of the self-force is 
therefore determined by the second group of terms in (87). These do not vanish in 
general. They are intrinsically connected to the finite speed of propagation associated 
with the wave equation, and are responsible for renormalizing a body’s momentum. 



3.5 Renormalization 



The generalized force exerted by 0s is entirely determined by the final part of (87). 
To understand this, first let (resp. denote the four-dimensional future (past) 
of inside the body’s worldtube: 

(^) := U (89) 

±(r— 5)>0 

Also define 

Es(0 ■■= \ . (90) 

Like Ps, this represents an 5' -dependent vector in K*. Using it, the second term in 
the expansion (87) for the self-force is simply 

1: [ dvf dV'[f(x,x') - f(x',x)] = -[E,,(0 ~ Es, (01 ( 91 ) 

2 J3 Jw\3 

Taking the limit ^ S{ while combining (84), (87), (88), and (91) finally shows 
that the generalized force can be written as 

^Ps(0 = - [ pEi^dS - ^EAO- (92) 

ds d^ 



Replacing the physical field 0 with cj) = cj) — cj)^ can therefore be accomplished only 
at the cost of the counterterm — d£’^/d^. That this is a total derivative suggests the 
introduction of an “effective generalized momentum” Ps satisfying 
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P,:=P, + E,. (93) 

For any finite scalar charge in a maximally-symmetric spacetime, 

^^Ps(0 = - (94) 

There is a sense in which Eg renormalizes the (bare) momentum Ps . The sum of Ps 
and Es behaves instantaneously as though it were the momentum of a test charge 
placed in the effective field <p. Furthermore, 

(V“v« - = 0. (95) 

Physically, it is not sufficient to motivate the renormalization Ps Ps merely by 
fact that the self-force is a total derivative. Essentially any function of one variable 
can be written as the total derivative of its integral. Indeed, one might introduce a 
constant sq and define 

PsiO ■= Ps(0 + EAO + r dr / dSpC^l (96) 

J SO J 

This does not vary at all with 5’. While it may be useful for some purposes, Ps is not 
a physically acceptable momentum. This is because it depends in an essential way 
on the configuration of the system for all times between sq and 5'. While Ps would 
be approximately local for 5' ~ it otherwise depends on a system’s history in a 
complicated way. 

The renormalized momentum Ps defined by (93) does not share this deficiency. 
Like Ps, it depends only on the body’s configuration in regions “near” 93^. The 
relevant region is, however, somewhat larger for Ps than it is for Ps. Definitions (83) 
and (90) imply that Es (0 depends on a neighborhood 93^ D 93^ defined to be the 
set of all points in which are null- or spacelike- separated to at least one point in 
93^. In terms of Synge’s function, 

= {v G 9U I a(x, y) > 0 for some y g 93^}. (97) 

^s is a (finite) four-dimensional region of spacetime. It extends into the past and 
future of 93^ by roughly the body’s light-crossing time. See Fig. 1. 

One might have guessed that a self-momentum at time 5' could be defined by 
integrating the stress-energy tensor associated with^^ 0s over a large hypersurface 
which contains 93^. Unfortunately, such integrals depend on gradients of 0s far 
outside of 93^. These, in turn, depend on the body’s state in the distant past and 
future. Such a definition is physically unacceptable in general. Nevertheless, it does 



^^Recalling (76), is quadratic in 0. The stress-energy tensor “associated with 0s” is taken to 
mean that portion of which is quadratic in 0s . Terms linear in 0s are not included. 
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make sense in the stationary limit, and may be shown to coincide with Es in that 
case [2]. In more dynamical cases, the Es defined here appears to be the only well- 
motivated possibility. 



3.6 Multipole Expansions 

Forces and torques exerted on relativistic scalar charges may be expanded exactly as 
in the Newtonian theory. Assuming that <j) can be accurately approximated using a 
Taylor series about some origin c 53^, the techniques of Sect. 2.2.5 may be used 
to show that (94) admits the multipole expansion 



d ^ 1 

n=Q 

The 2^^-pole moment of p which appears here is defined by 

[ a^^(x,Zs)--(T^^(x,Zs)p(x)dS, (99) 

and (f),ai---an denotes the ^th tensor extension of (p. Equation (98) may be compared 
with the Newtonian generalized force (51). Unlike its Newtonian counterpart, how- 
ever, the relativistic scalar monopole moment q may depend on time; the total charge 
is not necessarily conserved. Also note that the relativistic multipole expansion is 
intended only to be asymptotic. It may require truncation at large n (see, e.g., [30]). 



3. 7 Linear and Angular Momenta 

Like Ps, the effective generalized momentum Ps is an element of ^*. Expanding 
this in an appropriate basis recovers objects which may be interpreted as a body’s 
linear and angular momenta. The appropriate arguments are almost identical to those 
described in Sect. 2.2.6. 

Choosing a point every Killing field may be written as a linear combi- 
nation of 1- and 2-forms at [cf. (53)]. PsiO and A(0 clearly linear in so 

they too may be expanded in linear combinations of 1- and 2-forms at z^. Recalling 
(57), the coefficients in this combination may be identified as a body’s linear and 
angular momentum: 



1 



PsiO = P‘'{Zs,s)^a{Zs) + 



( 100 ) 
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Hats have been placed on and to emphasize that these are renormalized 
momenta. Bare quantities defined in terms of Ps may be introduced as well and 
shown to coincide with the momenta introduced by Dixon [6, 17, 24] for objects 
without electromagnetic charge-currents. The bare momenta obey more complicated 
laws of motion, and are not considered any further. 

Differentiating (100) while using (61) leads to an implicit evolution equation for 
p<^ and 5"'^: 



aabc-d 






d 

ds 



( 101 ) 



Varying over all and all = ^[a^b] recovers the explicit equations 



d^ 2 



d^ 



= + AT"'’. 



ab 



( 102 ) 



The force and torque appearing here may be found in integral form 

by comparing (94) and (101). Using the multipole expansion (98) instead. 



oo ^ 
n\ 

n=0 



" n\ 



b]dv-dn, 



\cd\ •••dfi 



n=0 



(103) 

(104) 



The laws of motion (102)-(104) describe bulk features of essentially arbitrary self- 
interacting scalar charge distributions in maximally- symmetric backgrounds. As in 
the Newtonian case, all explicit dependence on is contained in the Mathisson- 
Papapetrou terms. These terms are kinematic, and may again be traced to the changing 
character of Killing fields evaluated at different points. 

All conservation laws discussed in the Newtonian context generalize immediately. 
If C^(f) = 0 for some particular Killing field 

Ps(ip) = p‘"^a + ^S“^'^a'>Pb = (constant). (105) 



Similarly, 



= 0. (106) 

These results are exact. They hold independently of any choices made for . They 
also apply to approximate momenta evolved via any consistent truncation of the 
multipole series. 
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Although the relativistic multipole expansions are structurally almost identical 
to their Newtonian counterparts, it is important to emphasize that the effective field 
is far more difficult to compute in relativistic contexts. In Newtonian gravity, 0 is 
simply the external potential and is easily computed given the instantaneous external 
mass distribution of the universe. The relativistic effective potential can, however, 
depend in complicated ways on boundary conditions, initial data, and past history. 
Using retarded boundary conditions, the relativistic <j) typically depends on p, and is 
therefore not interpretable as a purely external field. 

Another property of the relativistic theory is that the angular momentum has 
six independent components rather than three. Consider a unit timelike vector 
at . This acts like a local frame which may be used to decompose into two 
components. Given there exist and which satisfy 



^ab — ^abcd^ ^ ( 107 ) 

and UaS^ = UaJn^ = 0. Writing out explicitly in fiat spacetime in the limit 
of negligible self-interaction suggests that represents a body’s “ordinary” angu- 
lar momentum about Similarly, may be interpreted as the dipole moment 
of a body’s energy distribution. Relativistically, these are two aspects of the same 
physical structure. The split (S^, m^) is closely analogous to the decom- 

position Fab (Ea, Ba) of an electromagnetic field into its electric and magnetic 
components. 



3.8 Center of Mass 

Thus far, the foliation {93^} of 2U used to define the generalized momentum has 
been left unspecified. The collection of events {z^} used to perform the multipole 
expansion (98) has been arbitrary as well. This constitutes a considerable amount of 
freedom. 

One simplifying strategy is to first associate a hypersurface with each possible 
point in This could be accomplished by, e.g., defining 93^ to be the past- (or 
future-) null cone with vertex Zs- A timelike worldline parametrized by {z^} then 
results in a null foliation of Alternatively, a spacelike foliation may be chosen as 
described in 117, 24, 31]. 

Regardless, defining each 93^ in terms of z^ reduces all freedom in the law of 
motion to the choice of a single worldline (and its parametrization). Recall that in 
Newtonian gravity, a body’s center of mass is the location about which its mass dipole 
moment vanishes. Relativistically, the dipole moment of a body’s stress-energy tensor 
is proportional to S^^{zs, s). In general, there is no choice of z^ which can be used 
to make this vanish entirely. It is, however, possible to use translations to set = 0 
as defined in (107). This requires the introduction of a frame with which to choose 
an appropriate dipole moment. Consider the zero-momentum frame where 
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p‘^=mu‘^. (108) 

m'* is defined to be a unit vector, so the rest mass m must satisfy 

m := ^/-p^^Pa- (109) 

A center of mass 7 ^ may now be defined by demanding that 

S‘^\l.,s)pai^,,s) = 0. (110) 



This can be interpreted as requiring that the dipole moment of a body’s energy 
distribution vanish as seen by a zero-momentum observer at 7 ^ . It is a highly implicit 
definition. Good existence and uniqueness results are known for the closely-related 
Dixon momenta [32, 33], but not more generally. We nevertheless assume that a 
unique worldline (and associated foliation) can be found in this way. Other choices 
are also possible, however. 

A general relation between the center of mass 4-velocity and the linear momentum 
may be found by differentiating (110). The result of this differentiation is solved 
explicitly for 7 ^ in [17, 31], resulting in^^ 



W7« = - N\u^ - 



sob _ 1 



m2 + 



(Ill) 



pqrs 



This assumes that the parameter 5 ' has been chosen such that 7 ^ pa = —m, and also 
that all instances of have been replaced with 7 ^ . In principle, it is possible for 
the denominator -h Rpqps here to vanish, indicating a breakdown of the 

center of mass condition. This can occur only if the curvature scale is comparable 
to a body’s own size, in which case it is unlikely that any simple description of 
an extended body in terms of its center of mass is likely to be useful. In more 
typical cases, it is straightforward to obtain multipole approximations of ( 111 ) by 
substituting appropriately-truncated versions of (103) and (104). 

Also note that 7 ^ is not necessarily collinear with p^. The difference p^ — 
may be interpreted as a “hidden mechanical momentum.” Simple examples of hidden 
momentum are commonly discussed in electromagnetic problems (see, e.g., [ 8 , 34, 
35]), but occur much more generally. Some consequences of the hidden momentum 
are discussed in [36, 37]. 

The center of mass condition provides more than just a relation between the 
momentum and the velocity. It also implies that , s) can be written entirely in 

terms of the spin vector s). Inverting (107) while using (110), 



^^References [17, 31] derive the momentum- velocity relation using Dixon’s momenta without a 
scalar field, but in a spacetime which is not maximally- symmetric. Here, Dixon’s momenta are 
modified by Eg, there is a scalar field, and the spacetime is maximally- symmetric. Despite these 
differences, the relevant tensor manipulations are identical. 
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Sails, s) = -^-eabcdU^S‘=‘^. (H2) 

Differentiating this and applying (102), the spin vector evaluated about the center of 
mass is seen to satisfy 



^ = -\^abcdu'’N^‘^ +Ua[sb^^. (113) 

The first term here represents a torque in the ordinary sense. The second term is 
responsible for the Thomas precession, and may be made more explicit by substitut- 
ing ( 102 ) and (108). 

An evolution equation may also be derived for the mass m , which is not necessarily 
constant. Variations in m are not an exotic effect; masses change even for monopole 
test bodies coupled to relativistic scalar fields. In general, use of (102) and (109) 
shows that the mass evaluated using = 7 ^ satisfies [24] 



dm 

ds 



- 

-7sFa + NatU^ — . 



(114) 



The final term here may be made more explicit by using (102) and (108) to eliminate 

DwVd^. 



3.9 Monopole Approximation 



The equations derived here are quite complicated in general. Some intuition for them 
may be gained by truncating the multipole series at monopole order. Inspection of 
(103) and (104) then shows that 

Fa = -qVa^, Nat = 0. (115) 

Further restricting to cases where q = (constant), it follows from (114) that 

m — q^ = (constant). (116) 

m — qtj) may therefore be viewed as a conserved energy for the system. Note that it 
is the effective field 0 which occurs here, not the physical field 

Contracting(113) with 5^ also shows that 5^ := S^Sa = (constant), meaning that 
the spin vector can only process in the monopole approximation. The rate at which 
this occurs may be simplified by first recalling that in any maximally- symmetric 
spacetime, there exists a constant k such that 



Rabcd = I^ga[c9d]b- 



( 117 ) 
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Of course, /^ = 0 in the flat background of special relativity. Regardless, the spin 
evolution is independent of 



= -{qlm)UaS^Vbl (118) 

It experiences a purely Thomas-like precession. The momentum-velocity relation 
(111) does, by contrast, retain explicit evidence of the curvature, reducing to 

mYs =p'^ + { . \ '^^‘=‘^UbScVdl ( 119 ) 

\1 + ^K{S/my J 

The linear momentum is also affected by k: 



ds 



-qVa(p + q 



/ \K(S/m)^ \ / 

y 1 + jK(S/my 



+ UaU>’ - SaS^ 



/s^) ^bl 



( 120 ) 



The overall force is therefore a particular linear transformation of —qVa^. Up to 
an overall factor, the second term here extracts that component of qVa^ which is 
orthogonal to both and . 

Together, (116) and (118)-(120) determine the evolution of a scalar charge in 
the monopole approximation. Despite the approximations which have already been 
made, these equations remain rather formidable. They may be simplified further by 
demanding that 5 = 0 at some initial time. The monopole approximation implies 
that an initially non- spinning particle remains non- spinning, so it is consistent to set 
5 = 0 for all time. It also follows that and 

7 “ = -{q/m){g^‘’ + ( 121 ) 



In the test body limit where (j) ^ (j), this is the typical equation adopted for the motion 
of a point particle with scalar charge q. A point particle limit of this equation which 
still allows for self-interaction is equivalent to what is known as the Detweiler- 
Whiting regularization [10, 29]. This regularization — which originally arose via 
heuristic arguments associated with the singularity structure of point particle fields — 
is a special case of the much more general results derived here (and which first 
appeared in [2]). Its origin is unrelated to any point particle limits or to the singularities 
associated with them. 

It is shown in Sect. 4.2 that standard self-force results follow easily from (121). 
That section also generalizes all laws of motion derived here to apply to scalar charges 
moving in arbitrarily-curved spacetimes. 
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4 Motion in Curved Spacetimes 

The discussion up to this point has made extensive use of Killing vector fields. 
This is familiar and simple, but not necessary. The first step towards understanding 
motion in generic spacetimes is to find a suitable replacement for the space of Killing 
vector fields. Once this is accomplished, the problem of motion for extended charges 
coupled to scalar fields is considered once again. This example is used to illustrate 
a new type of renormalization which occurs in spacetimes without symmetries. The 
techniques used to solve the scalar problem are then adapted to discuss motion in 
electromagnetic fields. Lastly, we consider motion in general relativity, where the 
objects of interest dynamically modify the geometry itself. 



4.1 Generalized Killing Fields 

Recalling (35) or its relativistic analog (78), the generalized momenta used in 
maximally- symmetric spacetimes are defined as linear operators over the space K 
of Killing vector fields. There is, however, no obstacle to replacing K by some other 
vector space Kq. This is the approach we take to defining momenta in generic space- 
times. Although several notions of generalized or approximate Killing fields exist in 
the literature [38-40], only one of these [1] appears to be suitable for our purposes. 
We describe it now. 

The space of generalized Killing fields used here can be motivated axiomatically. 
First note that avoiding significant modifications to the formalism developed thus 
far requires that all elements of Kg be vector fields on spacetime (or space in non- 
relativistic problems). It is also reasonable to require that: 

1 . All genuine Killing vectors which might exist are also generalized Killing vectors : 
K c Kg. 

2. K = Kg in maximally- symmetric spacetimes. 

3. The dimensionality of Kg can depend only on the number of spacetime dimen- 
sions. 

The first of these conditions is clearly necessary for any Kg which may be said to 
generalize K. The second condition ensures that there are not “too many” generalized 
Killing fields in simple cases. The final condition is more subtle. It guarantees that 
the generalized momentum contains the “same amount” of information regardless 
of the metric. More specifically, a generalized momentum defined using Kq must 
be decomposable in terms of linear and angular momenta with the correct number 
of components. Recalling (34), 

1 

dim Kg = -N{N + 1) > dim K (122) 



in any A -dimensional space. 
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The given conditions restrict Kq, but do not define it. An additional constraint 
is needed which describes how those elements of Kg which are not also elements 
of K preserve an appropriate geometric structure. It is not, of course, possible to 
demand that they preserve the metric. Symmetries of the connection or curvature are 
unsuitable as well. The only reasonable possibilities are nonlocal. 

4.1.1 Symmetries About a Point 

First consider the Riemannian case.^^ Recalling (50), any Killing vector field used to 
drag two points v and x' preserves the separation vector — cr^(v, x') between those 
points. A slightly weaker condition can be used to define generalized Killing vectors 
even when no genuine Killing vectors exist. Suppose that a particular point v has been 
fixed and demand that Ag(v) be defined as the set of all vector fields satisfying 



The result clearly forms a vector space. Unfortunately Ag (x) is too large. In fiat space, 
it becomes independent of v and coincides with the space of affine collineations: vec- 
tor fields satisfying Va^^Qbc = 0- Geometrically, affine collineations represent sym- 
metries which preserve the connection. In generic spaces, Ag(x) may be described 
as a space of generalized affine collineations with respect to x. 

The space of Killing vector fields is known to be a vector subspace of the affine 
collineations. Similarly, generalized Killing fields Kg(x) may be obtained as an 
appropriate subspace of Ag(x). It is sufficient to demand only that the appropriate 
vector fields be exactly Killing at x : 



The set of all vector fields satisfying this and (123) are denoted by Kg(x). They are 
the generalized Killing fields with respect to x. Any genuine Killing fields which 
may exist are also elements of Kg(x). 

Many geometric structures are preserved by generalized Killing fields. Equation 
(124) can be shown to generalize to 



C^a^(x\x) = 0. 



(123) 






(124) 



jC^^gabM = ^ajC^^gbcM = 0 , 



(125) 



^^The same geometric conditions can also be imposed in Lorentzian geometries. Physically, how- 
ever, the vector fields discussed here are most useful in non-relativistic contexts. A more complicated 
structure described in Sect. 4.1.2 is better-suited to Lorentzian physics. 
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which holds only at the special point x . More generally, “projected Killing equations” 
are valid wherever the generalized Killing fields are defined [1]: 

cr'^'(x, x')C^ga'b'(.x') = x')a^\x, x')VaiC^gb'c'{x') = 0 . (126) 

The elements of Kg{x) also preserve all distances from v in the sense that 
£^cr(v, x') = 0. In general, (vO is a solution to the geodesic deviation (or Jacobi) 
equation along the geodesic connecting x and x' . 

The generalized Killing fields may be shown to admit an expansion in terms of 1- 
and 2-forms at x. Expanding (123) in terms of covariant derivatives shows that for 
every g Kg(x), 



(127) 

The bitensors and which appear here — known as Jacobi propagators — 
are explicitly given by (56). The expansion (53) of Killing vector fields is therefore 
identical to the expansion (127) of generalized Killing vector fields. Varying and 
Va(,b arbitrarily, it is clear that dim Kq{x) = N -\- \N{N — 1) = \N{N -h 1) in V 
spatial dimensions. 

The generalized Killing fields defined by (123) and (124) provide a notion of 
symmetry with respect to a point. They may be used to analyze non-relativistic 
motion in geometries which do not admit any exact symmetries. This is not pursued 
here. We instead focus on relativistic motion, in which case it is more appropriate 
to consider a different kind of generalized Killing field which provides a notion of 
symmetry near a worldline instead of a point. 



4.1.2 Symmetries About a Worldline 

In relativistic contexts, it is useful to define a Kg which takes as arguments a timelike 
worldline and a foliation instead than a single point. Given a worldtube = {03^ | 
5 ' G M} in a Lorentzian spacetime (AJ, gab) of dimension V, consider a foliation 
{^^}. Also consider a timelike worldline Z parametrized by := ^053^. The 
definition of Kg(Z, {53^}) in this context is as follows: First, (125) is enforced 
for all z^. This provides a sense in which the elements of Kq{Z, {^ 5 }) generalize 
Poincare symmetries “near” Z. It implies that the generalized Killing fields and their 
first derivatives satisfy the Killing transport equations on Z. Moreover, 

^a^biAz = -Rbc/^d, (128) 

which generalizes (61). This describes, e.g., how generalized Killing fields which 
might appear purely rotational or boost-like at one point acquire translational compo- 
nents at nearby points. When applied to problems of motion, it leads to the Mathisson- 
Papapetrou spin-curvature force. 
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Demanding that (125) hold on Z describes the generalized Killing fields only on 
that worldline. They may be extended outwards into W by demanding that 

jC^a^\x',Zs) = 0 (129) 

for each x' e 93 ^. This is merely a restriction of (123). It implies that (127) 
holds whenever there exists some 5 * such that x = Zs and x' e 93^. Elements of 
Kg(Z, { 93 ^}) may therefore be specified using an arbitrary pair of 1- and 2-forms at 
any point on Z. As required, the generalized Killing fields form a vector space with 
dimension ^N(N 1). Additionally, C^a(x\ Zs) = 0 and 

a“'{x\ Zs)cr^'{x', Zs)^^ga'b'ix\ Zs) = 0 ( 130 ) 

whenever x' e 93 ^ [1]. The elements of Kq(Z, { 93 ^}) are the generalized Killing 
fields used in the remainder of this work. 



4.2 Scalar Charges in Curved Spacetimes 

We now return to the motion of scalar charges as discussed in Sect. 3, but no longer 
require that the background spacetime admit any symmetries. Consider a body cou- 
pled to a Klein-Gordon field 0 in a four-dimensional spacetime (7W, gab)- This body 
is assumed to be contained inside a worldtube W d M with finite spatial extent and 
no other matter. Its stress-energy tensor is assumed to satisfy (77). 

A generalized momentum is easily defined by reusing (78), but with the space 
K employed there replaced by an appropriate space Kg(Z, {93^}) of generalized 
Killing fields as described in Sect. 4.1. This requires the introduction of a timelike 
worldline Z and a foliation {93^} of Supposing that these structures have been 
chosen — perhaps using center of mass conditions — let 




for all G Kg(Z, {93^}). For each i*, this is a vector in the ten-dimensional space 
Kq{Z, {93^}). The associated linear and angular momenta pa and Sab coincide with 
those introduced by Dixon [6, 17, 24] for matter which does not couple to an elec- 
tromagnetic field. 

Using stress-energy conservation to differentiate the generalized momentum with 
respect to the time parameter 5', 



^^Dixon’s papers never considered matter coupled to scalar fields. The momenta associated with 
(131) are those which arise naturally for objects falling freely in curved spacetimes. 
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The first term on the right-hand side of this expression is not present in its maximally- 
symmetric counterpart (79); extra forces arise when the are not Killing. These 
may be interpreted as gravitational effects. While sufficiently small test bodies fall 
along geodesics in curved spacetimes, the same is not true for more extended masses. 
Gravity then exerts nonzero 4-forces and 4-torques which are described by the C^Qab 
term in (132). If expanded in a multipole series, (125) implies that such effects first 
appear at quadrupole order. This is described more fully in Sect. 4.2.2. 

The effect of the scalar field on the second term on the right-hand side of (132) 
may be understood by repeating the arguments of Sects. 3.4 and 3.5. This results in 
Eq. (94) being replaced by 



ds ./% L 



L 



- p(x)jC^$(x) 



1 f dV'p(x)p(x')C^G(x.x') 

2 Jw 



d5. 



(133) 



The effective field which appears here is defined by 0 := 0 — 0s, where 0s satisfies 
(83) and G is the Detweiler- Whiting S-type Green function described in Sect. 3.4. The 
generalized momentum has also been replaced by the renormalized momentum 
Ps := Ps Es, where the self-momentum Eg is given by (90). 

An absence of spacetime symmetries explicitly affects self-interaction only via the 
second line of (133). Unlike in maximally- symmetric backgrounds, the Detweiler- 
Whiting Green function does not satisfy C^G = 0 in general. Indeed, no Green 
function can be constructed with this property, a result which could be viewed as 
implying that it is impossible to define an analog of Newton’s third law in generic 
spacetimes. It follows that the self-force cannot be entirely absorbed into a redefi- 
nition of the momentum. Understanding it requires another type of renormalization 
which affects the quadrupole and higher multipole moments of 



4.2.1 Breakdown of Newton’s Third Law 

The self-force which remains after renormalizing Pg depends on C^G. It may be 
understood physically by recalling that there is a sense in which G is constructed 
purely from the spacetime metric. It therefore follows that for any vector field 
whether it is a generalized Killing field or not, C^G must depend linearly on C^Qab- 
In this sense, the first and third terms on the right-hand side of (133) are both linear 
in C^Qab- They are physically very similar, both representing different aspects of the 
gravitational force [4] . 

As remarked in Sect. 3, a body’s inertia depends on both its own stress-energy 
tensor and the properties of its self-field. The inertia due to is described by 
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Ps and the inertia due to 0s by A body’s passive gravitational mass experiences 
a similar split. The gravitational force exerted on a body due to its stress-energy 
tensor is 




while the gravitational force exerted on a body’s self-field is instead described by 

-If dS [ dV'ppC^G. (135) 

Although it is difficult to do so explicitly, this latter expression can be transformed 
into a linear operator on C^Pab- It effectively adds to a body’s gravitational mass 
distribution as inferred by observing responses to different spacetime curvatures. In a 
multipole expansion, the force (135) may be viewed as renormalizing the quadrupole 
and higher multipole moments of a body’s stress-energy tensor. 

The failure of Newton’s third law provides a second mechanism by which self- 
fields induce renormalizations. It is distinct — both in its origin and in the quantities 
it affects — from the mechanism described in Sect. 3.5. The renormalization of a 
body’s momentum was shown to be closely connected to the hyperbolicity of the 
underlying field equation. The geometry-induced^^ failure of Newton’s third law can 
instead arise even for matter coupled to elliptic field equations. It affects only the 
quadrupole and higher moments of a body’s stress-energy tensor. Combined, the two 
types of renormalization affect all multipole moments of In this sense, one is 
led to the concept of an effective stress-energy tensor. This is defined quasi-locally, 
and can be identified with only in special cases. 

A fully explicit discussion of this effect is not known. To motivate it more directly, 
it is instructive to find a wave equation satisfied hy C^G. Noting that 

£^<5(x, x') = x')g^^{x)L^gab{x), (136) 

a Lie derivative of (80) yields 

(V" - P)C^G = 

lj2 

+ ^{g^^C^gab)G. (137) 



^^This type of renormalization fundamentally arises from the connection between C^G and G^gab 
which occurs for Green functions associated with the Klein-Gordon equation. In different theories, 
Lie derivatives of G can depend on fields other than the metric. Self-forces renormalize whichever 
moments are coupled to these fields. 
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Viewing C^G on the left-hand side of this equation as “independent” of the G appear- 
ing on the right-hand side suggests that is a solution to a wave equation sourced 
by C^Qab and its first derivative. A source which is independent of G may be found 
by applying the Klein-Gordon operator to both sides: 

- 2n i/{g“^C^gab)S. 

(138) 

This describes C^G as the solution to a fourth-order distributional differential equa- 
tion sourced by C^Qab and ^a^^dbc- 

Results like these may be used together with the field equation to integrate (135) 
by parts. Let 



rpab 

^ field, S 



1 

47T 



vVsvVs - + g-^4) 



(139) 



be the stress-energy tensor associated with 0s and define 



+ 0 / /3'(VVs£«G - 4>s^‘^C^G)dV. 

ott J2U 

The law of motion (133) then reduces to 



II 


r r 1 

/ /'T’Gb 1 n^ab \ G 

i 2 ^^t>ody ^ field, 




- 0 / I“dSa 



dS 



Tt^ASab. 



d^s 



(140) 



(141) 



The gravitational force in this expression clearly acts on the combined stress-energy 
tensor s • Unfortunately, the stress-energy tensor associated with the self- 

field does not have compact spatial support. The finite integration volume 93^ must 
therefore be compensated by the two boundary terms involving . If these can be 
ignored, it is evident that gravitational forces are determined only by ^fieM s • 

This cannot, however, be expected to hold generically. In general, there does not 
appear to be any reason to neglect X ^ . 

Approximations may instead be introduced which allow the renormalized mul- 
tipole moments to be computed essentially using local Taylor series [4]. Another 
approach, described here for the first time, is to use Hadamard series. Recalling the 
Hadamard form (82) for G, 
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— HC^(j)5{(j) — £^y0(cr) . 



(142) 



Our task is now to convert this into an expression where all Lie derivatives act on 
the metric. 

This is easily accomplished for the Lie derivatives of a and A. Differentiating the 
well-known identity cr^ = 2cr shows that 



(J^ Va'C^CF — C^CF = 



^(J (7 L^^Qa'b' ' 



(143) 



The differential operator {x, x')V a' appearing here is a covariant derivative along 

the geodesic connecting x and x' , so (143) may be viewed as an ordinary differential 
equation for C^cr. Letting jr denote a geodesic which is affinely parametrized by r 
while satisfying yo = -^ and y\ = x^ it follows that 



1 

£^a(x,x'} = - y^y^,C^gab(yr)dT. (144) 

2 Jo 

Moreover, an argument found in [4] shows that Lie derivatives of the van Vleck 
determinant A depend on 

In a‘/2 = -1 [g^^C^gab + g^'”' C^Qa'b’ + 27/'"'' V« . (145) 



Both C^a and A^/^ may therefore be written as line integrals — solutions to trans- 

port equations — which are linear in C^Qab- Substituting these expressions into (142) 
goes much of the way towards expressing C^G in terms of C^Qab- 

All that remains is to consider C^V . This is more difficult. Even V itself is com- 
plicated to compute. It is a solution to the homogeneous field equation which is 
symmetric in its arguments and satisfies 



V(x, x') = G+(x, x') -h G-(x, x') 



(146) 



whenever x' is timelike-separated from x. Alternatively, V can be computed using 
a transport equation along null geodesics [10]. For each x\ this transport equation 
may be used as boundary data with which to obtain V(-, xO- Extending this data 
outside of the null cones is essentially an “exterior characteristic problem:” One 
seeks a solution to a hyperbolic differential equation in the exterior of a null cone 
given values of the solution on that cone. Unlike interior characteristic problems, the 
general mathematical status of such problems is unclear. 

One way to proceed is to construct a Hadamard series. This is an ansatz which 
supposes that V can be expanded via [41] 
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00 

V(x, X^) = Vn(x, x^)a^(x, x^). 
n=0 



(147) 



The Vn here are determined by demanding that each explicit power of a vanish 
separately when this series is inserted into (V^V^ — /i^)V = 0. The result is an 
infinite tower of ordinary differential equations. The ^ = 0 case is governed by 

Ai/VVa(A-i/2vo) + Vo = 1 v"V«a1/2. (148) 

For each x', this determines V (•, x') on the light cones of x' . Higher-order terms are 
needed in the exteriors of these light cones. For all ^ > 1, 

A'/V'-V^CA-i/^y^) + („ + i)y„ = _Tv"V«y„_i. (149) 

2n 

It should be emphasized that the Hadamard series is not a Taylor expansion. The Vn 
are two-point scalar fields, not constants. Furthermore, the Hadamard series is known 
to converge only if the metric is analytic (and even then, it might converge only in a 
finite region) [41]. Although analyticity is quite a strong assumption, there may be 
other interesting cases where a finite Hadamard series can be used to approximate V 
up to a well-controlled remainder. 

Assuming that (147) is valid, it is easily used to compute Lie derivatives of V. 
This results in 

oo 

+ \)Vn+iCi:aY. (150) 



Lie derivatives of a may already be transformed into Lie derivatives of gab via (144). 
Lie derivatives of the Vn may instead be found by differentiating (148) and (149). 
The n = 0 case satisfies 

A*/V“Va(A-‘/2£^yo) + £^Vo = _ VoCYa) 

+ (151) 

for example. The left-hand side of this equation may be interpreted as an ordinary 
differential operator along the geodesic which connects the two arguments of Vq. 
All Lie derivatives in the source on the right-hand of this equation may, with the 
help of (144) and (145), be rewritten in terms of C^Qab- H follows that C^Vq can be 
expressed as a line integral involving C^Qab- Similar results also hold for C^Vn with 
n > 0. The detailed forms of these integrals are complicated and are not displayed 
here. The important point, however, is that all parts of C^G may be expressed as line 
integrals involving C^Qab- Changing variables appropriately and using the spatially- 
compact support of p then shows that the self-force (135) does indeed renormalize 
the gravitational force (134). 



372 



A.L Harte 



4.2.2 Multipole Expansions 

It is useful to expand the scalar force in a multipole series when 0 varies slowly. 
Similarly, the gravitational force may be expanded in its own multipole series when 
there is an appropriate sense in which gab does not vary too rapidly^^ inside each 
53^. Such expansions can be obtained using the techniques of Sects. 2.2.5 and 3.6. 
Recalling (94), (98), and (133), first note that 

d 1 1 /* 

n=Q ' 

- - / dS j dV'pp'C^G. (152) 

The two integrals which remain here are intrinsically gravitational. 

Given (125), it is evident that multipole expansion for the generalized force must 
begin at quadrupole order. More specifically, it may be shown that [4, 6, 17] 

If 1 1 

2 ^ ^ ^^9ab,cv-Cn^ (153) 

^ n=2^' 



where gab,ci-cn represents the ^th tensor extension of gab and i^^-^nab 2" -pole 

moment of Tensor extensions in this context are somewhat more complicated 
than in the scalar case discussed in Sect. 2.2.5. While they are still defined as those 
tensors which reduce to n partial derivatives when evaluated at the origin of a Riemann 
normal coordinate system, equations like (48) must be generalized for objects with 
nonzero tensorial rank (see, e.g., [4]). For this reason, explicit integrals relating 
jcv-cnab are significantly more complicated than their scalar analogs. These 

are not needed here, and may be found in [4,6]. They are Dixon’s multipole moments. 
Additionally, note that the first few tensor extensions of the metric are gab,c = 0 and 



9ab,c\C2 — ^^a{c\C 2 )b’> 9ab,c\C2C2 — {c\^\a\c 2 C'i)b’> 

_ 6 16 ^ 

9ab,c\C2C3C4 — ~^^{ciC2^\a\c3C4)b Y^^a(ciC2 ^|^|C3C4)J- 



(154) 

(155) 



Expanding both integrals in (152) while identifying coefficients in front of 
^^9ab,ci--cn results in a series structurally identical to (153), but with all bare mul- 
tipole moments j^^-^ncib j-eplaced by their renormalized counterparts 



^®More precisely, the coordinate components gtj must not vary too rapidly when expressed in 
a Riemann normal coordinate system with origin Physically, this is a significant restriction. It 
would be far better to perform multipole expansions only using effective metrics where gravitational 
self-fields have been appropriately removed. It is not known how to do to this for the full Einstein- 
Klein-Gordon system. 
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final multipole expansion for the generalized force acting on a self-interacting scalar 
charge distribution is therefore 

H 1^1 oo ^ 

= 2 Z - Z (156) 

n=2 n=0 

Gravitational terms first arise at quadrupole order, while scalar terms appear even at 
monopole order. 



4.2.3 Forces and Torques 

As in Sects. 2.2.6 and 3.7, the generalized momentum can be decomposed into linear 
and angular components These obey the law of motion (102), where the 

force and torque are now supplemented by gravitational terms at quadrupole and 
higher orders: 



= 9 Z - Z (157) 

z n\ n\ 

n=2 n=0 

00 ^ 
n=2 

oo ^ 

+ (158) 

n\ 

A center of mass may be defined exactly as described in Sect. 3.8. Applying (110), 
the momentum- velocity relation remains ( 111 ). 



4.2.4 Monopole Approximation 

Suppose that Z is identified with the center of mass worldline { 7 ^}. It is then inter- 
esting to consider the laws of motion truncated at monopole order. As explained in 
Sect. 3.9, the spin magnitude is conserved in such cases. It is therefore consistent to 
consider cases where Sa is negligible. Assuming this, an object’s mass m and center 
of mass 7 ^ satisfy 

m — = (constant), = —q(g^^ + 7 ? 7 ^ (159) 

whenever q = (constant). These are formally equivalent to the standard equations of 
motion adopted for scalar test charges, but with the physical field replaced everywhere 
by the effective field. Laws of motion applicable to self-interacting charges involve 
0 , not 0 . 
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Classical results on the scalar self-force are easily derived from (159). In the 
absence of any external charges (which does not necessarily imply trivial motion in 
curved spacetimes), it is natural to suppose that initial data for 0 has been prescribed in 
the distant past. If all details of that data have decayed sufficiently, the only remaining 
field is the retarded solution associated with the body’s own charge distribution. 
Generalizing this somewhat to also allow for a prescribed “external field” 0ext, 



While the retarded Green function G_ can be difficult to compute in nontrivial 
spacetimes, we assume that this problem has been solved. Using (81) and (83) then 
results in the effective potential^ ^ 



Although the “self-field” (ps has been removed from 0, remnants of the body’s 
charge distribution do remain. These are responsible for self-forces as they are com- 
monly defined in the literature, and give rise to physical phenomena such as radiation 
reaction. 

The “traditional” self-force problem involves point particle limits. Such limits 
consist of appropriate one-parameter families of charge distributions \p and stress- 
energy tensors supports collapse to a single timelike worldline as 

A ^ 0 (see, e.g., [28]). Suppose that these families are chosen such that a body’s 
physical size and mass asymptotically scale like as A ^ 0, suggesting that 
^qav-an scales like for all ^ > 0 and that ^jcv-cnab series ^^+1 fQj^ 
n >2. These conditions guarantee that the multipole series can be truncated at low 
order [as has already been assumed in (159)]. 

It is also important to demand that the time dependence of aP remain smooth as 
A ^ 0. This guarantees that a body’s internal dynamics remain slow compared to 
its light-crossing timescale, which is required to ensure that a charge does not self- 
accelerate in the absence of any external influence. Self-acceleration could occur if, 
e.g., internal oscillations conspired to generate strongly-collimated beams of radia- 
tion. Such cases are physically possible, but are typically excluded from self-force 
discussions. 

The A ^ 0 limit of the approximation which has just been sketched results 
in a timelike worldline satisfying the usual equations for a monopole test charge 
accelerated by 0ext- Effects typically described as self-forces occur at the following 
order. In principle, interactions between dipole moments and 0ext also occur at this 
order. These are dropped here for simplicity, leaving only (159). Without entering 



^^The two-point scalar V is to be understood here as equivalent to G± when its arguments are 
timelike-separated. This defines it even in the presence of caustics and other potential complications. 




(160) 



0 = 0ext + l [ p\G- - G+ + V)dy'. 



O / ' 

^ Jm 



(161) 
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into technical details, the appropriate approximation for the effective field in this limit 
may be shown to be given by (161) with a point particle charge density. Dropping 
all labels involving A, 



<^(x) = (j>eyx{x) + J[G-(x, 



7^0 - G+(x, 7^0 + V(x, 75')]ds' (162) 



plus terms of order A^. Unlike the A ^ 0 limit of 0, the limit of 0 is well-defined 
even at the particle’s location. The same is also true of its gradient. All necessary reg- 
ularizations have automatically been taken into account by first deriving the correct 
laws of motion for nonsingular extended bodies. 

The computations needed to evaluate at js are rather tedious, so we merely 
cite^^ the result [10]. Defining the projection operator := -h 7^ 7^ and 
assuming that q = (constant), 

v> = VVext + l(q^/m)h“‘’Ys[yb^c<Pext - 2(^/m)Vi,</.extV,</.ext] 

+ ^q(Rj^ - 2h‘^^Rbcl';) + q lim f V“G_(7„ 7,0d5'. (163) 

iz e^0+y_oo 



Substituting this into (159), the final equations of motion are 

Ts = -{q/m)h‘^'’^b4>ext + ^(q^/m)Ys['^b"^c4>ext - 2{q/m)Vb4>ext"^c(f>ext] 



-qRbcis + q 1™ 

o 



im r 

>o+y_( 



Vi,G_(7i, 7^0d^ 



') 



(164) 



and 



in - q(j>ext +q^\ ^R - lim [ G_( 7 ^, 7 i')ds' ) = (constant) (165) 

\12 e^O+J-oo } 

through first order in A. Essentially^^ the same results were first obtained by Quinn 
using an axiomatic argument [42]. Their derivation from first principles was later 
discussed in [2]. The integrals over the tail of G_ indicate that a charge’s motion 
can depend on its past history. This is related to the failure of Huygens’ principle. 



^^The point particle field derived in [10] includes a derivative of the particle’s acceleration. A careful 
treatment of the perturbation theory shows that such terms refer only to accelerations at lower order 
[28]. The self-consistent discussion which is implicit here therefore requires that accelerations be 
simplified using the zeroth order equation of motion. This is taken into account in (163). 

^^Some sign conventions in [2, 42] are different from those adopted here. 
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in which case the field a body sources in the past can scatter back towards it at later 
times. This effect disappears for matter coupled to massless fields in fiat spacetime, 
but is almost always present otherwise. 



4.3 Electromagnetic Charges 

The problem of electromagnetic self-force has inspired considerable discussion over 
the past century. Here, we show how it can be understood using a straightforward 
application of the formalism just described for scalar charges. The body of interest 
is assumed to be smooth and to be confined to a finite worldtube C Ad in a 
fixed four-dimensional spacetime (Ad, Qab)- H couples to an electromagnetic field 
Fab = ^[ab^ satisfying Maxwell’s equations 



from which it follows as an integrability condition that the body’s 4-current must 
be divergence-free. Its total charge 



is therefore a constant independent of 53^ (as long as this hypersurface completely 
contains the body of interest). Furthermore, the stress-energy tensor associated with 
Fab can be defined throughout via 



Identifying all remaining stress-energy as stress-energy conservation implies 

that 



^[a^bc^ — 0, V Fab — 



(166) 




(167) 




(168) 






(169) 



The right-hand side of this equation is the Lorentz force density. 



4.3.1 An Electromagnetic Momentum 



A generalized momentum may again be defined using (131). This requires a choice of 
worldline Z and a foliation {53^}, leading to a vector which resides in Kq(Z, {^ 5 }). 
Applying (169) shows that forces and torques follow from 
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+ FabCj'’^ d5. (170) 

This is the simplest approach. It is not, however, the only reasonable possibility. Ps 
has the unfortunate property that symmetries do not necessarily imply conservations 
laws. Even if there exists a vector field satisfying C^Fab = = 0, the 

associated momentum Pg is not necessarily conserved. 

Dixon [6, 24, 43] has proposed a different set of linear and angular momenta in 
this context which do form simple conservation laws in the presence of symmetries 
(among other desirable properties more generally). Translating his definitions into a 
generalized momentum P^ results in 

pP{0:=PsiO+ [ dSaJ^ix) [' duu-^a‘’'{y'^,Zs)e'iyu)Fbviyu)- ( 171 ) 
753, Jo 

The curve {y^ | u e [0, 1]} describes an affinely -parametrized geodesic satisfying 
yQ = Zs and yi = X. The correction to Ps which is used here represents a type of 
interaction between the electromagnetic field and its source. It can be motivated using 
symmetry considerations [24], the theory of multipole moments [6], or Lagrangian 
methods [43]. Some intuition for this interaction may be gained by introducing a 
vector potential Aa so that Fab = Then, 

P? + q(Aae)\,^ = Ps+ j^dSaJ‘'{AbS!’ - f^duu-^a^'C^Ab^. (172) 

Although this has been written in terms of a gauge-dependent vector potential, it is 
manifest from (171) that Ps and Pf are both gauge-invariant. 

Combining (172) with (170) shows that the generalized force associated with 
Dixon’s momentum is 

^(Pf + qAan =J^dS 

- — [ dSaJ“ f duu~^a‘‘'C(Aa>. (173) 

d.? 7®, Jo 

This is awkward to write more explicitly without introducing additional technical 
tools. Even so, it is simple to temporarily consider special cases where there exists 
a Killing vector field satisfying C^Fab = JJ"ii)9ab = 0- It is then possible to 
find a vector potential such that C^A^a^ = 0. The component of momentum 
conjugate to is therefore conserved in the sense that 



Pf{jp) + qA^^\^ = (constant). 



(174) 
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where is understood to be evaluated at Zs • Although this is reminiscent of the 

canonical momentum associated with a pointlike test charge, it is valid for essentially 
arbitrary extended charge distributions. 



4.3.2 The Self-field and Self-force 

Electromagnetic self-forces may be defined and removed from either Pg or P^. In 
both cases, it is convenient to work in the Lorenz gauge: 

= 0. (175) 

Maxwell’s equations then reduce to the single hyperbolic equation 

v‘’^bAa - Rj’Ab = -47t4. (176) 

Introducing the parallel propagator -^0 [10]. consider a Green function Gaa' 

satisfying 

1- ^Gaa' - Ra^Gba' = v'), 

Gaa'iA”) -X ) = Ga'aiA •> x)? 

3. Gaa'(.x, x') = 0 when x, x' are timelike- separated. 

These are the closest possible analogs of the constraints used to define G in Sect. 3.4. 
They characterize the S-type Detweiler- Whiting Green function for the reduced 
Maxwell equation (176). In terms of the advanced and retarded Green functions 
G^^/, there exists a homogeneous solution Vaa' such that 

Gaa’ = 1(G+ , + - Vaa’) = \l9aa’ - Vaa’@{a)]. (177) 

Although it can be difficult to find Gaa' explicitly in a particular spacetime, we 
assume that it is known. 

A self-field may be defined by convolving with Gaa' • Considering a space- 
time volume 93 c 2H, let 



A^m] := f Gaa’r'dV. (178) 

In the scalar case, the analog of this expression represents the “self-field” sourced in 
the region 93. The interpretation here is somewhat more obscure, as the restriction of 
to arbitrary regions is not necessarily conserved and is therefore unphysical. This 
definition is nevertheless useful. Without specification of any particular region, it is 
to be understood that 93 = so = A^[2H]. This (full) self-field is sourced by 
a conserved current, implying that := 2V[^A^j satisfies the complete Maxwell 
equations 
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V[aFl^=Q, V^F^=AT,Ja. (179) 

The same cannot necessarily be said for fields where 91 ^ W. 

Now consider changes in Ps due to A^. Applying the arguments of Sects. 3.4 and 
3.5 to (170) shows that the self-force has the form (85) with 



f{x,x') = (180) 

The average /(x, x') -\- f{x' , x) of action-reaction pairs reduces in this case to 

J^J^'C^Gaa' - ~ (181) 

The divergences which appear here are new features of the electromagnetic problem. 
Only their integrals matter, however, so they are easily dealt with. Defining the 
homogeneous effective field 



hb ■■= Fab - F^^, (182) 

the final result is that 

+ ej”tb + 1 Jjiv'j‘^j‘^'c^Gaa ^ , (183) 

where Ps := Ps + Es and 

+ f (eAl)j‘’dSh. (184) 

The renormalized momentum Ps therefore evolves via Fab rather than Fab- As 
explained in Sect. 4.2.1, the forces involving C^Qab and C^Gaa' in (183) combine in 
a natural way to form an effective gravitational force. Furthermore, Es is known to 
reduce in simple cases to the expected expression involving the stress-energy tensor 
associated with F^^ [3]. More generally, it should be thought of as a quasi-local 
functional of . 

A very similar result may be derived using the Dixon momentum P^ . This is 
most easily accomplished if a new self-momentum is introduced which satisfies 

f duu~^a‘’'c'F^,^,. (185) 

7 03 , Jo 



Defining P^ := P^ -F Ef, it follows that 
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+ ej”tb + \ JjV'j‘‘J‘^'C^Gaa^ 

+ — f dSaJ“ [ dwM“V^'f' Acs (186) 
ds 7®, Jo 

or equivalently 

A(^D + qXan = + 1 IjV'j^'r’C^Gaa'^ 

[ dSaJ^ [ duu~^a^ jC^Aa' 

dsj^^ "Jo ^ " 

(187) 



for any vector potential Aa satisfying Fab = 2V[qAt,]. 

If a Killing vector field exists which satisfies C^Fab = jC.^9ab = 0, it is 
possible to choose a vector potential for Fab with the property that C^A^a^ = 0. 
It then follows immediately that Pf + qA^a\^ is conserved. No similarly-simple 
conservation law is associated with Ps . 



4.3.3 Multipole Expansions 

Integral expressions for the generalized force are not particularly useful on their own. 
It is instead more interesting to consider their multipole expansions. Unlike in the 
scalar theories discussed earlier, more than one “reasonable” force may be considered 
in the electromagnetic case. As a matter of computation, it is simplest to expand the 
force associated with Pg. Dixon’s momentum is otherwise more attractive, however. 
Multipole series for the associated forces and torques have already been derived in a 
test body approximation [6] . The mechanics of the calculation are exactly the same 
here, and result in 

1 -j OO -j 

~ 2 S ^^9ab,cv-Cn + 9^ab^"z^s 

^ n=2 

OO 

+ X ( 188 ) 

n=l ^ 

The coefficients represent 2'^-pole moments of as defined (using the 

notation by Dixon [6, 44]. They are not to be confused with the scalar charge 

moments satisfying (99). The appearing in (188) represent 2^^ -pole moments 

of renormalized by Lie derivatives of Gaa' • Again, these are different from 

the renormalized stress-energy moments which appear in the scalar law of motion 
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(156). In the limit of negligible self-interaction, however, both definitions reduce to 
Dixon’s stress-energy moments [6]. For reference, the first tensor extensions of Fab 
are explicitly 

Fab,c = ^cFab, Fab,cd = ^(c^d)Kb ~ -Ff[aRb](cd/- ( 189 ) 

If there exists a Killing vector which satisfies C^jjFab = = 0, it 

has already been stated that is conserved exactly. It is evident 

from (188) that this quantity is also conserved in any consistent truncation of the 
multipole series. If a particular is Killing but does not necessarily preserve Fab, 
all gravitational terms vanish from (188) and 

= qPabri'l + Z (190) 

n=l ^ 

This holds for all e K Kg - It is all that arises for charges moving in fiat or de 
Sitter spacetimes. 



4.3.4 Linear and Angular Momentum 

Linear and angular momenta may be extracted from using the methods described 

in Sects. 2.2.6 and 3.7. Let and be defined by 

P? = P‘^ia + (191) 

Differentiating this and varying over all generalized Killing fields shows that 



d^ 



ab 



1 DiS*' 

= qF‘^bz'1 + + F", (192) 



where 



1 1 

~ 9 ^''^^'^adbc,dvdn + aPbc\,C2--Cn^ (193) 



n=2 



^ (« + !)! 



n=2 






n=0 



( 194 ) 



382 



A.L Harte 



Electromagnetic forces and torques defined in this way first arise at dipole order. 
The (monopole) Lorentz force depends — unlike any other electromagnetic terms — 
on Zs- It has therefore been separated out explicitly in (192). There is no similarly 
velocity-dependent electromagnetic torque. 

Defining the force to exclude the Lorentz force has the advantage that if is 
Killing and preserves Fab, the associated conservation law implies that 

Far + \Nab^‘^r = 0. (195) 

This does not involve q, and is directly analogous to the scalar result (106). 



4.3.5 Center of Mass 

The electromagnetic laws of motion (192)-(194) depend on both the foliation {93^} 
and the worldline Z. Center of mass conditions may be used to fix these structures 
as described in Sect. 3.8. An evolution equation for the center of mass position 7 ^ 
can then be obtained by differentiating Pa^ = 0. The result differs slightly from 
(111) due to the additional velocity-dependence associated with the Lorentz force. 
Adapting the methods of [3 1], the electromagnetic momentum- velocity relation may 
be shown to be 



= p“ - N\u'’ 



^‘’[mFh + (r - N%u''){qFbc - \RbcdfSdf)] 
nfi -^SP^^ipFpq 2^pqrs^'^^) 



(196) 



when 5 ' has been chosen such that 7 ^ pa = —m and is the unit timelike vector 
satisfying p^ = mu^ . 

Although a precise set of assumptions which imply this are not known, it is 
assumed here that the center of mass condition ( 111 ) admits a unique timelike solution 
in a sufficiently broad class of physical systems. Inspection of (196) shows that a 
breakdown of the center of mass condition must occur whenever 

- \s'’^ {qhc - \RbcdfS<>^^ = 0. (197) 

This is sufficient but not necessary. While the ^ = 0 case was dismissed in Sect. 3.8 
as unlikely to be physically relevant, the charged case is potentially more interesting. 
Consider an electron^^ in a magnetic field of order B. Setting q = e, m = nie, 
Rabc^ = 0, and S = fi/2, the denominator in (196) can diverge when B ^ lm\/eh ^ 
10^"^ Gauss. This may be viewed as the field strength at which an electron’s dipole 
energy KB becomes comparable to its rest mass. Quantum mechanically, it is also 



^^It is unclear that there is any sense in which an electron’s behavior can be modeled using equations 
derived for classical extended charges. Nevertheless, the example appears to be suggestive. 
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the field strength at which the separation between (Landau) energy levels becomes 
comparable to the rest mass energy. Indeed, this is the scale at which quantum 
electrodynamics is expected to become dominant. Although systems with 10^^ Gauss 
magnetic fields are far from direct laboratory experience, such fields are believed to 
exist around some neutron stars [45]. Even in somewhat smaller magnetic fields, 
hidden momentum effects predicted by the classical theory can become very large. 
Whether or not this has qualitative consequences for neutron star astrophysics is an 
open question. 

If the center of mass can be defined and the classical laws of physics remain valid, 
(196) may be combined with (192)-(194) to yield very general laws of motion. As 
in the scalar case, can also be reduced to a single spin vector satisfying 
(113). Similarly, the mass varies according to (1 14). Unlike in the scalar case, matter 
coupled to electromagnetic fields can change mass only at dipole and higher orders. 
Such effects are related to changes in a body’s internal energy due to work performed 
by (or against) the ambient fields [24] . 



4.3.6 Monopole Approximation 

In simple cases, the laws of motion governing an extended charge distribution may 
be truncated at monopole order. Within this approximation. Fa = Nab = 0 and (192) 
reduces to 



Dp^ 

d^- 



TT’Ci ' b I D Cl cbc ‘ d 

= bis + S 1^ , 






(198) 



Contracting the second of these equations with Sab while using (110) again shows 
that S'^ := S^Sa = (constant). An object which is initially not spinning therefore 
remains non-spinning in this approximation. Consider these cases for simplicity. The 
momentum- velocity relation then reduces to the mass remains constant, 

and the body accelerates via the Lorentz force law 



my; 






(199) 



in the effective electromagnetic field Fab- The effective field always satisfies the 
vacuum Maxwell equations. It is generically distinct from the physical field Fab 
which governs the acceleration of nearby test charges. 

In many cases of interest, it is useful to model Fab as the sum of some external 
field F^^ and the retarded field associated with a body’s charge distribution: 

Fab = Flf + 2j V[„G-,,7"'dV. 



( 200 ) 
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In these cases, it follows from (177) and (182) that the effective field must satisfy 

Fab = Flf + - G+,, + Vh^y)j'’'AV'. (201) 

Performing a point particle limit as discussed in Sect. 4.2.4 and [3, 28], the lowest- 
order self-interaction effects follow from (201) with a pointlike current. Through first 
order in the expansion parameter A, 

Fab = FX + qj V[,(G-,, - G+,, + Vbw)Xds'. (202) 

Evaluating this on Xs and recalling the projection operator -h 7^7^, it may 

be shown that [10] 

Fab = fX + Iqii9llahbf^[iq/A)il + {qjmfgf'^FlfFX 

+ l^crf] + 2^1im f (203) 

z e->0+J-oo 

Substitution into (199) finally yields the equation of motion for a self-interacting 
“pointlike” electric charge: 

= qg‘^^FXYs + ^iq^ /m)h‘^^X[X^4FX + iq/m)g‘^^ FXF}f] 

+ \q\‘^^RbcX + 2q^ lim f V[«G-.,77;d/. (204) 

J e->0+y_oo 

In curved spacetime, this is a reduced-order version — see Footnote 22 — of a result 
first obtained by DeWitt and Brehme [21] (with corrections due to Hobbs [46]). The 
second line of (204) vanishes in fiat spacetime, leaving only 

mX = qg^^FXX + \ici^lFi)h-^XVii^dFX + {qlm')g‘^^ FXFf^. (205) 

This is essentially the Abraham-Lorentz-Dirac equation [8], but in a reduced-order 
form typically attributed to Landau and Lifshiftz [47]. 

While instructive, the neglect of spin and electromagnetic dipole effects in (204) 
can be overly restrictive. This restriction is easily dropped by making use of the 
general multipole expansions derived above. The resulting changes are qualitatively 
significant [3] (see also [28]): A hidden momentum appears, external fields may 
change an object’s rest mass, and the spin magnitude may change due to torques 
exerted by . 
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For monopole point particles in flat spacetime, the use of Fab instead of Fab in 
the laws of motion was first suggested by Dirac [48]. The appropriate generalization 
for monopole particles in curved spacetimes was obtained much more recently by 
Detweiler and Whiting [29]. Both of these proposals were essentially physically- 
motivated axioms intended to define the dynamics of point charges. The discussion 
here, which follows [3], shows that these regularization schemes are actually limits 
of laws of motion which hold rigorously for nonsingular extended charge distribu- 
tions. Once the general laws of motion (192)-(194) and (196) have been derived, 
more explicit results such as (204) follow very easily. So do their spin-dependent 
generalizations. 



4.4 General Relativity 

All results discussed up to this point assume that the spacetime metric is known 
beforehand and has been fixed. This assumption may be relaxed. Doing so allows 
the consideration of self-gravitating masses in general relativity. We take a mini- 
mal approach to this problem by adapting the techniques of the previous sections as 
closely as possible. Although there are some deficiencies to this approach, consid- 
erable progress can still be made. 

Let the spacetime be described by (M, gab) and the body of interest be contained 
inside a spatially-compact worldtube W d M. Only the purely gravitational problem 
is considered here, meaning that objects can interact with each other solely via 
their influence on the metric; electromagnetic and similar long-range interactions 
are excluded. It follows that inside Letting gab be a solution to 

Einstein’s equation 

^ab ~ ^gab^ ~ ’ (206) 



it follows that 



^bT^o% = 0 - ( 207 ) 

These two equations replace, e.g., (73) and (77) used to understand the motion of 
scalar charges. Unlike in the scalar or electromagnetic cases, the gravitational law of 
motion (207) is a consequence of the field equation, not an independent assumption. 

Both and the metric inside W are assumed to be smooth. This precludes 
the consideration of black holes. Although unfortunate, it appears difficult to remove 
this restriction in a non-perturbative theory which describes the motions of individual 
objects (and not only characteristics of the entire spacetime). While quantities such 
as momenta might be associated with a black hole horizon [49, 50], adopting such 
definitions typically excludes the discussion of objects without horizons. Alterna- 
tively, one may consider an effective background and compute momenta associated 
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with, e.g., the Landau-Lifshitz pseudotensor [51, 52]. This is troublesome as well. 
Nevertheless, the motion of black holes can be sensibly discussed within certain 
approximation schemes [10, 53, 54]. These consider only the metric outside of the 
body of interest, and apply versions of matched asymptotic expansions in appropriate 
“buffer regions.” We instead consider internal metrics as well as external ones, but 
do not require the existence of a buffer region. 

As in other theories, the laws of motion derived here involve an effective field 
which is distinct from the physical one. In general relativity, the relevant field is 
the metric. We therefore use two metrics, which makes index raising and lowering 
ambiguous. All factors of the appropriate metric are therefore displayed explicitly in 
this section. 



4.4.1 Generalized Momentum 

As in scalar and electromagnetic systems, the first step to understanding the motion of 
a self-gravitating mass is to write down a generalized momentum. It is this momentum 
which is used to describe an object’s large-scale behavior. Introducing a foliation 
{Q 35 } of 2IT and a worldline Z, the linear and angular momenta proposed by Dixon 
[6, 17, 24] are conjugate to generalized Killing fields constructed using the physical 
metric gab • Adding an extra argument to Kg to reflect this, the appropriate generalized 
momentum is 



For each s, this is an element of the ten-dimensional vector space K^(Z, {^^}; g). 
The associated linear and angular momenta have a number of useful properties 
[6, 31, 55, 56]. Using (207), their time evolution satisfies 



If C^gab varies slowly throughout 53^, forces and torques can be expanded in multi- 
pole series as described in [4, 6] and in Sect. 4.2.2. 

While such assumptions can be useful for test bodies, they are too strong for self- 
gravitating masses . Moving beyond the test body regime first requires the introduction 
of an effective metric gab such that — after appropriate renormalizations — the C^gab 
appearing in (209) can be replaced by C^gab- If ^^9ab varies slowly, the resulting 
integral for the generalized force can be expanded in a multipole series in the usual 
way. One additional subtlety which occurs in the gravitational problem is that even if a 
particular gab can itself be adequately approximated using a low-order Taylor expan- 
sion, the same can not necessarily be said for C^gab when g Kg(Z, {93^}; g). 
The generalized Killing fields associated with Dixon’s momenta involve the physical 
metric and all of its attendant difficulties. These difficulties are partially inherited by 
the generalized Killing fields used to define P^. 





(209) 
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One way out of this problem^^ is to choose a bare momentum Pg (0 defined by 
an integral which is structurally identical to (209), but where all are elements of 
Kg(Z, {93,}; g) rather than Kg(Z, |93,}; g). Let 

Ps(0--= [ T^Xabc^dSa, e ^ Kg{Z,{^s};9)- (210) 

We take this to be the bare momentum of a self-gravitating mass. The effective metric 
which appears here is to be regarded at this stage as an additional parameter. The 
generalized force dP , /ds follows from a trivial modification of (209). Furthermore, 
Dixon’s momenta are recovered in a test mass limit where gab ~ If there exists 
di e Kg(Z, {93,1; g) such that C^gab = 0. it is evident that Ps(^p) must be 
conserved. 



4.4.2 Self-fields and Laws of Motion 

There are many possible ways to extract an effective metric gab from the physical 
metric gab- The simplest generalization of the previous discussions involves a two- 
point tensor field Gaba'b'i^^ which satisfies 



Gaba'b' = G(^ab)a'P = Gab{a'P) 



( 211 ) 



and 



Gaba'b'i^^ -^ ) — Ga'b'abi^ ? •^)* 



( 212 ) 



For any such propagator, consider an effective metric gab defined via 



dab — dab + dab^ 



(213) 



where 



gl,m := j^Gaba'b'T^ydV (214) 

and g^^ = Note that the volume element in this “self-field” is the one 

associated with gab, not gab- Substituting (213) into the appropriately-modified form 
of (209) shows that the 5' -integral of the self-field’s contribution to dPs/ds has the 
form (85) with force density 



It would be more elegant to instead demand that (-^5 9) ^^dKc{Z, {03,}; g) be identical 

or otherwise closely related. Such an assumption would restrict possible relations between gab and 
gab, and is an avenue which has not been explored. 



388 



A.L Harte 



/ = + 2Va£,‘^Gb,a'b')- (215) 

Applying (87) and related results then transforms the law of motion into 

l/s = \j^ 1 l^dS l^dVXXT^^C^Gaba'b', (216) 

where = Ps + Eg and 

Es = \ (^I^T^%£^g!bm7]dV - . (217) 

Es is a functional of which effectively acts like the momentum of the self-field. 

If C^Gaba'b' is a linear functional of C^Qab^ the term involving C^Gaba'b' in (216) 
renormalizes a body’s quadrupole and higher multipole moments. In these cases, a 
multipole expansion of (216) yields 




The tensor extensions appearing here are extensions of gab in a spacetime with metric 
Qab- This means, for example, that gab,c = 0 and gab,cd = jRaicd/ghf [cf. (154)]. 

Equation (2 1 8) represents not a particular law of motion, but a class of them. This is 
because many different propagators may be found which satisfy (211) and (212), and 
whose Lie derivatives with respect to are quasi-local in C^Qab- Choosing different 
propagators with these properties leads to different effective metrics, different self- 
momenta, and different effective multipole moments. Any of these combinations 
satisfies (216), and also (218) when the appropriate gab is sufficiently well-behaved. It 
is of course preferable to choose a propagator such that the associated multipole series 
may “typically” be truncated at low order without significant loss of accuracy. This 
condition is vague. In the electromagnetic and scalar theories, a (rather imperfect) 
proxy was the requirement that the effective fields be solutions to the vacuum field 
equation. The analogous condition in general relativity would be Rab = 0, where 
Rab denotes the Ricci tensor associated with gab- This does not appear to be possible 
within the currently-considered class of effective metrics. 

We take a pragmatic approach and suppose that Gaba'b' is the Detweiler- Whiting 
S-type Green function associated with 

g ^c^dGaba'b' ~ [^9 ^ Ef(^ab) ” 1 ” 2 QabQ ^9 E fb^^G cda'b' 

= -167r(gac5M - ]jgabgcd)g\a’g‘^ b')hx. x'). (219) 
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This is essentially the prescription suggested in [5] . In terms of retarded and advanced 
solutions to (219), the S-type Green function satisfies 

Gaba'b' = + ^aba'b' ~ ^aba'bd (220) 

for some bitensor Vaba'b' which is an appropriate solution to the homogeneous version 
of (219). Somewhat more explicitly, 

Gaba'b' = ^gacObd ~ ^gab9cd)g\a'9‘‘ ~ ^Vaba'b'®(a). (221) 

Coupling this Green function with (213) and (214) defines gab in terms of gab- 
Unlike in scalar or electromagnetic theories, the definition of the effective field is 
highly implicit in general relativity. It reduces to a simple subtraction only in linear 
perturbation theory. More generally, the Green function itself depends on the field 
which one intends to find, and must be found by iteration or related methods. 

Despite the nonlinearity of the map gab 9ab, the effective metric does not nec- 
essarily satisfy the vacuum Einstein equation. The specific differential equation (219) 
is nevertheless inspired by Lorenz-gauge perturbation theory, and has the following 
desirable properties: 

1. If gab is sufficiently close to a background metric gab satisfying the vacuum 
Einstein equation Rab = 0, gab satisfies the vacuum Einstein equation linearized 
about gab- 

2. The differential operator is self-adjoint. 

3. The trace of (219) can be solved independently of the full equation. 

The first condition guarantees that the effective metric is reasonable at least in first 
order perturbation theory. Self-adjointness is useful because it allows the reciprocity 
condition (212) to be enforced. Einally, it is important for technical reasons to know 
the trace g^^ Gaba'b' of Gaba'b'- The form of (219) may be used to show that this 
satisfies 



9^^ Gaba'b' = QOa'b'^ (222) 

where Q is an S-type Detweiler- Whiting Green function for the nonminimally- 
coupled scalar equation 



Vfc + ]^R)g = 167t5(x, x'). 



(223) 
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4.4.3 Linear and Angular Momentum 

Using the Green function associated with (219) to construct gab and Ps, linear and 
angular momenta may be extracted in the usual way. For any g Kg{Z, {^s]\ q) 
and any Zs ^ Z, let 

= Pa{Zs,s)i\Zs) + (224) 

The angular momentum defined in this way is antisymmetric in the sense the 
^{a^^b)c _ Q Differentiating (224) using a covariant derivative associated with 
Qab shows that 

^ = -\Rab/z^,S^d + ^ = (rgbd - S^A)Pczi + N\, (225) 

d^- 2 d^ 

where 

1 1 

= 2 ^ (226) 

n=2 



and 



oo 

= X 

n=2 

jci-cnab represents the renormalized 2 ^^-pole moment of the body’s stress-energy 
tensor. Despite the notation, these are not the same as the moments appearing in 
the scalar and electromagnetic multipole expansions (156) and (188), which are 
renormalized differently. In the test body limit where Qab ~ 9ab^ (225)-(227) reduce 
to the multipole expansions derived by Dixon [ 6 ]. More generally, they show — if 
the multipole expansion is valid — that with appropriate renormalizations, a self- 
gravitating body moves instantaneously as though it were a test body in the effective 
metric Qab^ Note in particular that the derivatives of the momenta which appear in the 
evolution equations are derivatives associated with gab , not gab • This is a consequence 
of choosing the generalized Killing fields to be constructed using gab instead of gab- 



'.gRb . (227) 



4.4.4 Center of Mass 

A center of mass frame may be defined by choosing an appropriate foliation together 
with the worldline { 7 ^} which guarantees that PaS^b = 0 when = 7 ^. A body’s 
linear momentum is then related to its center of mass velocity via an appropriately 
“hatted” version of ( 111 ). 
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4.4.5 A Simple Case 

If dPs/ds is sufficiently small, a body’s quadmpole and higher multipole moments 
might be neglected. In these cases, it follows from (225) that the motion is described 
by the Mathis son-Papapetrou equations in the effective metric: 

^ 2^ = (.rhd - S^A)Pczi (228) 

Choosing = 7 ^, the squared spin magnitude S^bS^a is necessarily conserved. It is 
therefore consistent to once again consider systems with vanishing spin. Assuming 
that S^b = 0, 



D 



7 - 7 ? = 0 . 



(229) 



Non-spinning masses whose quadmpole and higher interactions may be neglected 
therefore fall on geodesics associated with gab- This generalizes the well-known 
result that small test bodies in general relativity fall on geodesics associated with the 
background spacetime. 

Moving beyond the test body limit, the difficult step is to compute gab- What 
is typically referred to as the first order gravitational self-force may nevertheless 
be derived by considering an appropriate family of successively-smaller extended 
masses. To lowest nontrivial order, the effective metric looks like the metric of a 
point particle moving on an appropriate vacuum background gab which is close to 
gab- Using overbars to denote quantities associated with gab and assuming retarded 
boundary conditions. 



9ab — 9ab + 2 ^ j ^^ciba'b' ^aba'b' ^aba'b')ls is • (230) 

This is well-behaved even on the body’s worldline. So is the connection associated 
with it, which may be computed using, e.g., methods described in [10]. Substituting 
the result into (229) recovers the MiSaTaQuWa equation commonly used to describe 
the first order gravitational self-force [5]. Comparisons have not yet been made with 
second order calculations of the gravitational self-force which have recently been 
completed using other methods [57, 58]. 



4.4.6 Future Directions 

The formulation of the gravitational problem of motion remains somewhat unsatis- 
factory. Most importantly, the effective metric which has been adopted here (and in 
[5]) is somewhat ad hoc. It is inspired by Lorenz-gauge perturbation theory, but this 
has no particular significance other than being one way to guarantee hyperbolic field 
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equations. More seriously, the gab defined here does not satisfy the vacuum Einstein 
equation except in certain limiting cases. This seems unnatural. It would be prefer- 
able if the general relativistic laws of motion were completely identical in structure 
to the laws satisfied by test bodies moving in vacuum backgrounds. A condition like 
= 0 would also suggest, at least intuitively, that the associated gab might vary 
slowly in a wide variety of physical systems. The importance of slow variation to the 
application of multipole expansions makes it extremely interesting to search for an 
effective metric which admits a multipole expansion like (218) while also being an 
exact solution to the vacuum Einstein equation. Although such a metric^^ has not yet 
been found, there are several promising routes by which progress might be made. 

The simplest conceivable modifications of the formalism described here retains 
the bare momentum (210) while altering the effective metric gab- It is trivial to 
accomplish this by, for example, modifying the differential equation satisfied by 
Gaba'b' or by introducing ^-point propagators similar to those in (44). It can also be 
useful to alter the functional relation (213) between gab, gab, and any integrals which 
may be present. Despite being very simple analytically, relating two metrics to one 
another via the addition of a second-rank tensor is geometrically rather awkward. 

Better-motivated functional relationships between the physical and effective met- 
rics may be more convenient. Geometrically, perhaps the simplest conceivable map 
between two metrics is a conformal transformation. If gab = ^^gab for an appropri- 
ate ^2, it is straightforward^^ to obtain an effective metric which exactly satisfies the 
trace ^ = 0 of the vacuum Einstein equation. This is not enough, however. More 
degrees of freedom are necessary. It may be possible to go further by combining 
an appropriate conformal factor with a “generalized Kerr-Schild transformation” so 
that 



9ab — ^ idab H” ^{a^b)) (231) 

for some 1 -forms la and ka which are null with respect to gab (and therefore null 
with respect to gab as well). Despite the simplicity of this expansion, there is strong 
evidence that it is very general: Given any analytic gab I a, k a) triplets can always 

be chosen, at least locally, which guarantee that gab is flat [59]. Although it is not 
known how such choices interact with the laws of motion, the possibility of a flat 
(or conformally flat) effective metric is intriguing. Among other benefits, it might 
eliminate the need for generalized Killing vector s.^^ 



^^It could also be interesting to consider reformulations where an effective connection is sought 
instead of an effective metric. 

Analyzing the effect of a conformal factor on the laws of motion is similar to considering objects 
coupled to a particular type of nonlinear scalar field. Despite the nonlinearity, such systems can be 
understood exactly using only minimal adaptations of the formalism used to analyze the (linear) 
Klein-Gordon problem. 

^^Quasi-local momenta have recently been proposed in general relativity which use isometric 
embeddings to lift fiat Killing fields into arbitrary spacetimes [60, 61]. See also [62] for a pro- 
posal which allows conformal Killing vectors to be introduced in geometries without symmetries. 
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Combinations of these observations can perhaps be combined to provide one 
(implicit) map gab 9ab with the desired properties. It is likely simpler, however, 
to instead use them to construct a continuous flow of metrics \gab which smoothly 
deforms gab = Q9ab into an appropriate gab = oo9ab- The A parameter is not neces- 
sarily physical, but might be interpreted roughly as the reciprocal of the influence of 
a body’s internal scales. Flows like these have the advantage that individual “steps” 
A ^ A -h dA can be viewed as (easily-controlled) linear perturbations. Indeed, it is 
straightforward to impose differential relations on the A-dependence of \gab which 
ensure that a flow removes any initial stress-energy as A ^ oo. This requires using 
a 1 -parameter family of Green functions xGaba'b' associated with Einstein’s equa- 
tion linearized about each \gab- Separately, it is also straightforward to construct 
flows which lead to well-behaved laws of motion. What is more difficult is to find 
a flow which accomplishes both of these tasks simultaneously. If this were found, 
varying A would likely vary an object’s effective metric, its effective momentum, 
and its effective multipole moments. While only the A ^ oo limit might be physi- 
cal, such variations are highly reminiscent of the running couplings which arise in 
renormalization group flows. 

Regardless, a great deal of freedom clearly exists and may be exploited to better 
understand the problem of motion in general relativity. The resulting insights may 
also shed new light on nonlinear problems more generally. 



5 Discussion 

The techniques described in this review provide a unified and largely non-perturbative 
formalism with which to better understand how objects move. Although these tech- 
niques have thus far been applied only to a handful of specific theories — Newtonian 
gravity, Klein-Gordon theory, electromagnetism, and general relativity — they are 
easily generalized. 

One of the central concepts employed here is what we have called the “generalized 
momentum.” This is used as a convenient observable with which to describe an 
object’s motion in the large, and represents a body’s momentum not as a tensor either 
in the interior of the spacetime or at infinity, but instead as a linear map over a more 
abstract vector space. This automatically takes into account the nonlocality inherent 
in the momentum concept and also makes explicit how particular components of the 
momentum can be “conjugate to,” e.g., symmetry-generating vector fields. 

Another important property of the generalized momentum is that it unifies a body’s 
linear and angular momenta into a single object. Given a generalized momentum, 
linear and angular components can easily be extracted. This process depends, how- 
ever, on extra information, namely a choice of “observer” in the sense of a preferred 
origin. This origin is arbitrary. It affects the angular momentum even in elemen- 
tary discussions of Newtonian mechanics, but more generally influences an object’s 
linear momentum as well. This has an interesting physical consequence: Mathisson- 
Papapetrou terms arise in the evolution equations governing a body's linear and 
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angular momenta due to the motion of the origin used to extract these components 
of the generalized momentum. Mathisson-Papapetrou effects are therefore kinematic 
in nature, arising from the changing “character” of each generalized (or genuine) 
Killing field at different points. 

Once a generalized momentum has been defined as a particular linear map on a 
particular vector space, stress-energy conservation may be used to derive its rate of 
change. The resulting generalized force is another linear map on the same vector 
space. Letting be a particular element of that space, generalized forces typically 
have the form 



where f represents some (not necessarily scalar) long-range field and p its source. 
93^ is an appropriate 3 -volume and dS an associated volume element. Integrals like 
these can be difficult to evaluate directly, so it is important to seek approximations 
in practical problems. The simplest such approximations involve some combination 
of test body and smallness conditions which guarantee that “varies slowly” 
throughout 93^ . A multipole expansion can then be performed to express F in terms 
of f and its derivatives as well as the multipole moments of p computed on a (largely 
arbitrary) worldline {zs}’ F = q(zs, s)C^(l)(zs) + q^(Zs, s)C^Va(l)(Zs) + • • • 

It is far more difficult to obtain useful multipole expansions when an object’s 
self-field can no longer be ignored. The potentially-complicated nature of p is then 
inherited by f via the field equation, and there is typically no sense in which 
can be approximated using Taylor-like expansions inside 93^. Coping with this is 
perhaps the main theoretical problem associated with self-interaction in the classical 
theory of motion. 

Self-interaction is dealt with here by considering methods which alter the inte- 
grand in (232) without affecting the integral as a whole (or affecting it only via terms 
which can be interpreted as renormalizations). Particularly useful for this purpose 
are nonlocal deformations f ^ f generated by appropriate classes of propagators. 
Although the cases discussed here have used 2-point Green functions associated with 
appropriate field equations, other types of propagators can be more useful in other 
contexts. Regardless, the large variety of possible deformations may be tailored to 
optimally simplify whichever problem is at hand. 

In particular, it is often possible to find a deformation f ^ f such that varies 
slowly even when does not. Appropriately-modified multipole expansions can 
then be applied much more generally than might have been expected. This leads to 
the main physical principle which dictates motion in each of the theories we have 
considered: Laws governing the motion of self -interacting masses are structurally 
identical to laws governing the motion of test bodies. The fields appearing in these 
laws are nontrivial, however. Objects generally act as though they were accelerated 
not by the physical fields (i.e., 0), but by certain “effective fields” instead. 




(232) 
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The use of effective fields to understand problems of motion is not new per se. 
Standard formulations of Newtonian celestial mechanics heavily rely, for example, 
on external gravitational potentials which are distinct from the physical potentials. 
What has been stressed here is that generalizing the external field concept (where the 
“external” label is replaced by the more appropriate “effective”) is similarly essen- 
tial for a simple understanding of motion even in highly non-Newtonian regimes. 
All classical results on the self-force can easily be recovered, for example, once 
the appropriately-formulated laws of motion have been derived. Even point particle 
limits of these laws are well-defined precisely as stated; they require no independent 
postulates or regularizations. 

The standard deformation 0 ^ 0 of the physical Newtonian gravitational poten- 
tial into its effective counterpart leaves forces and torques completely unaffected: The 
Newtonian self-force and self-torque both vanish. Other theories are not so simple. 
Writing generalized forces in terms of effective fields generally requires the intro- 
duction of compensating counterterms. It is only when these counterterms have a par- 
ticularly simple form that the associated effective field is likely to be useful. Indeed, 
we have considered systems where these terms act only to make a body’s momenta 
or other multipole moments appear to be shifted from those moments which might 
have been deduced using knowledge of a body’s internal structure. The details of this 
structure are rarely known in practice, in which case it is natural to “remove” residual 
forces and torques by appropriately redefining an object’s momenta or other mul- 
tipole moments. These are renormalizations. They affect generic extended objects, 
and are always finite in this context. Considerable effort has been devoted here to 
identifying renormalizations and interpreting them physically. 

The resulting techniques have shown that a large variety of renormalizations are 
possible even in simple theories. The effective 4-momentum of an electric charge 
may differ from its bare momentum not only in length (i.e., mass), but also in direc- 
tion. Spins and center of mass positions can be renormalized as well. Adding the 
additional complication of a curved spacetime, even the quadrupole and higher mul- 
tipole moments associated with a body’s stress-energy tensor may be dynamically 
shifted via the forces exerted by its self-field. 

Two general mechanisms have been shown responsible for these effects. Both of 
these are associated with generalizations of — or failures to generalize — Newton’s 
third law. One mechanism relates from a direct violation of this law, while the other 
arises from an inability to fully take advantage of “action-reaction cancellations.” 
The second of these is simpler and affects a body’s linear and angular momenta. It 
is associated with self-fields which are nonlocal in time, in which case forces are 
sourced in four dimensions but act on matter only in three-dimensional slices. If 
the propagators associated with these statements satisfy certain minimal constraints, 
the inability to construct action-reaction pairs in this context conspires to dynami- 
cally shift an object’s momenta. Such effects are essentially universal in relativistic 
theories, but can also be relevant for some non-relativistic systems. 

The second renormalization mechanism discussed here stems from more direct 
violations of Newton’s third law. Mathematically, it is related to the behavior of 
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the relevant propagators under Lie dragging. If, say, a self-field is defined in terms 
of a propagator G, and C^G depends only on C^cj) for some field 0, the multipole 
moments coupling to 0 are renormalized by the self-force. In the cases considered 
here, 0 was the metric and the relevant moments were those associated with a body’s 
stress-energy tensor. The same mechanism applied to a nonlinear scalar theory would 
instead renormalize a body’s charge moments. 

Despite the generality of these results, much remains to be learned. Besides the 
various technical details which remain open — some of which have been mentioned 
in the text — it would also be interesting to understand how the techniques developed 
here can be applied in new ways. It may be possible, for example, to adapt these 
techniques to systems where long-range fields couple to an object’s surface instead 
of its volume. Such problems arise when considering the motion of solid objects 
through fluids, among other cases. More generally, it might be possible to investigate 
problems which are not related to motion at all. Quantities similar to (232) occur in 
many fields of physics and mathematics, as do various types of regularizations and 
renormalizations. It appears likely that the methods developed here can be applied 
to better understand at least some of these systems. Such speculations have only just 
begun to be explored. 
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Spacetimes: An Introduction 
to Gravitational Self-Force 



Adam Pound 



Abstract In recent years, asymptotic approximation schemes have been developed 
to describe the motion of a small compact object through a vacuum background to 
any order in perturbation theory. The schemes are based on rigorous methods of 
matched asymptotic expansions, which account for the object’s finite size, require 
no “regularization” of divergent quantities, and are valid for strong fields and rel- 
ativistic speeds. Up to couplings of the object’s multipole moments to the external 
background curvature, these schemes have established that at least through second 
order in perturbation theory, the object’s motion satisfies a generalized equivalence 
principle: it moves on a geodesic of a certain smooth metric satisfying the vacuum 
Einstein equation. I describe the foundations of this result, particularly focusing on 
the fundamental notion of how a small object’s motion is represented in perturbation 
theory. The three common representations of perturbed motion are (i) the “Gralla- 
Wald” description in terms of small deviations from a reference geodesic, (ii) the 
“self-consistent” description in terms of a worldline that obeys a self-accelerated 
equation of motion, and (iii) the “osculating geodesics” description, which utilizes 
both (i) and (ii). Because of the coordinate freedom in general relativity, any coordi- 
nate desscription of motion in perturbation theory is intimately related to the theory’s 
gauge freedom. I describe asymptotic solutions of the Einstein equations adapted to 
each of the three representations of motion, and I discuss the gauge freedom associ- 
ated with each. I conclude with a discussion of how gauge freedom must be refined 
in the context of long-term dynamics. 
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1 Preamble and Survey 



Consider a small object moving through a curved spacetime. What path does it 
follow? At the level of undergraduate physics, the answer is satisfyingly simple: 
if the object is sufficiently small and light, it can be idealized as a test particle, 
which does not affect the geometry around it and which moves on a geodesic of 
that geometry. But what if we do away with that idealization? The real object does 
perturb the geometry around it; how, then, does the object move in that geometry, 
which it itself affects? 

In Newtonian gravity, we may ask the analogous question — how does a massive 
body move when it contributes to the gravitational field around it? — and we may 
happily provide an answer without yet leaving undergraduate physics: if the body 
is sufficiently spherical, it can be treated as a point particle located at its center of 
mass, and the motion of that center of mass is governed by the external gravitational 
fields produced by all other masses in the system; the object does not feel its own 
field, at least in so far as its center-of-mass motion is concerned. 

However, in general relativity, the situation becomes radically more complicated. 
Because of the nonlinearity of the Einstein equations, an extended object cannot, 
in general, be modelled as a point particle without invoking post-hoc regularization 
procedures [ 1 , 2 ]. Furthermore, in a curved background, an object, even an asymptot- 
ically small one, does feel its own field, for reasons discussed below (and elsewhere 
in these proceedings). Hence, the field is said to exert a gravitational self -force on 
the object. 

One might wonder if this effect is actually relevant in any realistic scenarios. The 
answer, unequivocally, is “Yes”. Suppose we are interested in a bound binary of 
widely separated compact objects of comparable masses mi and m2 moving slowly 
in each other’s weak mutual gravity. Each of the objects is small compared to the 
other scales in the system (for example, the typical orbital separation R, or the radius 
of curvature of mi’s field at m2’s position). In a Newtonian approximation, mi is 
subject only to m2’s Newtonian field, which at the position of mi exerts a force 
(per unit mass) ~ . But if one requires anything more accurate than the 

Newtonian approximation, the moment one steps to the post-Newtonian level, self- 
forces can no longer be ignored: mi’s field, which we can think of as scaling like 
modifies the Newtonian fields, giving rise to a post-Newtonian force 
per unit mass that scales like Fp^ ~ ~ m 1 m2//?^ [ 3 ] , similar in magnitude 

to any other post-Newtonian effect.^ 

In these systems just described, however, each object, while small compared to the 
radius of curvature of the ambient external field it finds itself in, is not small compared 



Tn a certain sense, an object’s mass affects its own motion even in a Newtonian binary. Each object 
follows a Keplerian orbit about the system’s center of mass, not about the center of the other object. 
Since the center of mass is shifted by the object’s own mass, the object affects its own motion; more 
plainly, mi influences its own motion by influencing that of m 2 . This is a more indirect effect than 
the type described above, but in practice, distinguishing it from any other post-test-body effect is 
nontrivial. See Ref. [4] for a discussion. 
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to the other object. What if we are interested in a case where there are truly only 
two scales? Take a binary system of compact objects of mass m and M satisfying 
m M, and specifically focus on the regime in which the orbital separation R is of 
order M. In this case, it seems the gravity of m must certainly have a very small effect 
on its own motion, and it must very nearly follow a geodesic of the metric of M. And 
yet, even in this scenario, m’s gravitational self-force cannot be neglected. Although 
the effect is very small over a few orbits, the system continually radiates energy in 
the form of gravitational waves (or equivalently, the self-force does negative work), 
causing the orbit to shrink, and eventually causing m to collide with M (or plunge 
into M, if M is a black hole). In other words, the self-force has long-term, secular 
effects on the motion which make it impossible to ignore. 

Both of these two types of systems — binaries of widely separated bodies whose 
mutual gravity is weak, and binaries of objects with very dissimilar masses, called 
extreme-mass-ratio inspirals (EMRIs) — are of increasing relevance to modern astro- 
physics. The prospect of directly detecting gravitational waves emitted from compact 
binaries, and extracting information about the binaries’ strong-field dynamics from 
those waves, has spurred an international effort to study them. In the case of widely 
separated bodies of comparable mass, the main method of analysis has been post- 
Newtonian theory,^ a historied subject modern overviews of which can be found 
in the review articles [5, 6] and the recent textbook [7]. In the case of EMRIs, 
the main method of analysis has been self-force theory; for summaries of efforts 
to model EMRIs, I refer the reader to the reviews [8, 9], the more up-to-date but 
brief survey [10], and the contributions of Babak et al., Wardell, and Vega to these 
proceedings. 

In this paper, I seek to provide a single, unified description of gravitational self- 
force theory. Roughly speaking, this formalism consists of a perturbative expansion 
in powers of the small object’s mass m. Although I will refer to EMRIs to motivate 
many of the methods and problems I discuss, my focus will instead be on foundational 
issues related to the problem of motion of a small object. My aim is to complement 
extant reviews [8, 9] by concentrating on three themes given limited attention in those 
reviews: (i) self-force theory at arbitrary perturbative order, (ii) differing ways to rep- 
resent perturbed motion, and the asymptotic expansions of the metric corresponding 
to each, and (iii) the relationship between perturbed motion and gauge freedom. My 
presentation mostly follows the methods and viewpoint of Refs. [11-17] though it 
takes additional inspiration from the work of Gralla and Wald [18-20] and the classic 
papers of Mino et al. [21] and Detweiler and Whiting [22, 23]. To avoid excessive 
length, rather than striving for complete self-containment, I will often refer the reader 
to Refs. [9, 11, 15, 16] for mathematical tools and technical details. 

Most of what I discuss could be applied to objects carrying scalar or electric 
charges, but for simplicity I restrict my attention to the purely gravitational case. 
I refer the reader to Refs. [9, 24-28] for discussions of self-forces due to scalar and 



^Of course, once the two bodies are sufficiently close to each other, they interact in a highly nonlinear, 
highly relativistic way. In that regime, one must use numerical relativity to solve the fully nonlinear 
Einstein equations for the system. 
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electromagnetic fields on fixed background geometries, and to Refs. [29, 30] for 
recent work on the coupled system in which the metric reacts to both the mass and 
the scalar or electromagnetic field. 

In the remainder of this introduction, I make an extensive survey of the main 
concepts and results of self-force theory, beginning with the standard picture of a 
point mass in linearized perturbation theory and proceeding to describe the more 
robust framework now available for studying the motion of extended (but small) 
objects at any order in perturbation theory. 

Notation. Throughout this paper, Greek indices run from 0 to 4 and are raised and 
lowered with a background metric Latin indices run from 1 to 3 and are freely 
raised and lowered with the Euclidean metric 5ij . Sans-serif symbols such as g^^ refer 
to tensors on the perturbed spacetime rather than on the background. A semicolon and 
V refer to a covariant derivative compatible with to the covariant derivative 

compatible with g^^, and to the covariant derivative compatible with an “effective 
metric” g^jj. e = 1 is used to count powers of m, and labels such as the in 
refer to perturbative order ; I freely write these labels as either super- or subscripts. 
I work in geometric units with G = c = 1 . 



1.1 A Point Particle Picture: The MiSaTaQuWa Equation 
and Its Interpretations 

1.1.1 Self-Interaction with the Tail of the Perturbation 

As mentioned above, a point particle stress-energy tensor is not a well-defined source 
in the nonlinear Einstein equations. However, it is a fine source in the linearized 
theory. Eor simplicity, assume our object of mass m is isolated, such that in some 
large region, we can take it to be the sole source of stress-energy in the system. Now 
consider a metric g^^ = g^y -h -h O(e^), where the background metric g^y is 
a vacuum solution to the nonlinear Einstein equations, and is the leading-order 
perturbation due to our small object. Linearizing the Einstein equations in /z^^, we 
obtain 



where linearized Einstein tensor, and is the 

leading-order approximation to the object’s stress-energy tensor. Now suppose that 
the object’s stress-energy can be approximated by 



6G^Ahh = SttT;*, 



( 1 ) 




7 



( 2 ) 
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which is the stress-energy of a point mass moving on a worldline 7 with coordinates 
in the background spacetime. Here ^ is the particle’s four-velocity, r is 

x4/^Q!_ Q!\ 

its proper time (as measured in and d(x, z) = ^ covariant delta 

distribution, with g being the determinant of 

Unlike the nonlinear Einstein equations with a point particle source, Eq. (1) has 
a perfectly well-defined solution. If we introduce the trace-reversed perturbation 
~ impose the Lorenz gauge condition = 0(e), 

then the linearized Einstein equation takes the form of a wave equation, 

E^,[h^] = (3) 

where 

( 4 ) 

□ = and is the Riemann tensor of the background. Assuming 

the existence of a global retarded Green’s function G x')^ for this wave 
equation, we can write the retarded solution as 

hl,{x- z)=4j =4j ( 5 ) 

where a primed index refers to the tangent space at the source point x'. 

Now, if the background were fiat, waves would propagate precisely on the light 
cone, and the retarded Green’s function would be supported only on points con- 
nected by a null curve: = 5^^, 6^, ^ ~ ~ ^ where (t, is a Carte- 

sian coordinate system, t — \x^ — x^' \ is the retarded time, and \x^ — x^' \ = 
J — x^'){x^ — x^') is the spatial distance between the points v and x' . How- 
ever, in a curved spacetime, Huygen’s principle no longer holds. Waves scatter off the 
spacetime curvature, causing them to propagate from a source point x' both on the 
future null cone of x' and within that cone. Correspondingly, the Green’s function at 
a point X has support both on the past null cone of x and within it. This implies that if 
we look at the field at a point x near the worldline, we can split it into two pieces: 
a direct piece, corresponding to the portion of the field that propagated to x from a 
point z^(Tret) along a null curve; and a so-called tail piece, corresponding to 
the portion of the field that propagated from all earlier points z^(r < Tret) on the 
worldline. The direct piece diverges like a Coulomb field, behaving as 1 / r (where r 
is a geodesic distance from the particle). The tail piece, on the other hand, is finite. 

A detailed analysis (such as the one presented in the bulk of this paper) reveals that 
at leading order, the mass m is constant, and the force on the particle vanishes; in other 



^My conventions for the Green’s function are those of Ref. [9]; Ref. [9] also contains a pedagogical 
introduction to bitensors, objects which live in the tangent spaces of two different points x and x' . 
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words, it behaves as a test particle in The same analysis applied at subleading 
order reveals that the direct piece of the field exerts no force on the particle, but the 
tail piece does exert a force, and the equation of motion is found to be 



where ^ projects orthogonally to the worldline, and 

the tail term is given by 



the integral covers all of the worldline earlier than the point z^(r) at which the force 
is evaluated. The bar atop again denotes a trace reversal. 

Equation (6) is termed the MiSaTaQuWa equation, after Mino et al. [21], who 
first derived it, and Quinn and Wald [24], who re-derived it very shortly thereafter 
using a very different, independent method. The intuitive picture to glean from the 
MiSaTaQuWa result is that the direct piece of the field is analogous to a Coulomb 
field, moving with the particle and exerting no force on it, in the same way the self- 
field exerts no force on a body in Newtonian gravity. Very loosely speaking, from the 
perspective of the particle, the tail, consisting as it does of backscattered radiation, 
is indistinguishable from any other incoming radiation. In other loose words, it is 
effectively an external field, and like an external field, it exerts a force. 

Much of this paper is devoted to showing how the MiSaTaQuWa result can be 
robustly justified, and higher-order corrections to it can be found, within a systematic 
expansion of the Einstein equations. As a byproduct of that analysis, in Sect. 4.1 
I will show that the setup of the linearized system with a point particle source in this 
section is rigorously justified — even for a non-material object such as a black hole, 
and even though at nonlinear orders the field equations cannot be written in terms of 
such a source. However, before moving on to those matters, we may say significantly 
more on the basis of the point-particle result. 

1.1.2 The Detweiler- Whiting Description: A Generalized 
Equivalence Principle 

In the original MiSaTaQuWa papers [21, 24], the authors noted that Eq. (6) appears 
to have the form of the geodesic equation in a metric g^^jj -h when that geodesic 
equation is expanded to linear order in However, is not in any way a nice 
field. It does not satisfy any particularly meaningful field equation, nor is it even 
differentiable at the particle [9] [despite superficial appearances in Eq. (6)]. 



^I refer the reader to Appendix 1 for the expansion of the geodesic equation in powers of a metric 
perturbation. 




( 6 ) 




(7) 
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Detweiler and Whiting provided a more compelling form of the MiSaTaQuWa 
result [22, 23]. Rather than splitting the retarded field into a direct piece and a tail, 
they split it as 



hU = h% + hll. ( 8 ) 

The singular field can be interpreted as the bound self-field of the particle. Like 

the direct piece of the field, it exhibits a 1/r, Coulomb divergence at the particle; 
but unlike the direct piece, it satisfies the inhomogeneous linearized Einstein equa- 
tion = — IbyrT^^^, bolstering its interpretation as a self-field. Similarly, 

the regular field improves on the interpretation of the tail: it includes all the 
backscattered radiation in the tail, but it is a smooth solution to the homogeneous 
wave equation = 0. Hence, more than we could of the tail, we can think of 

h^l as an effectively external field, propagating independently of the particle. From 
it we can define what I will variously call an effective metric or effectively external 
metric + eh^l. 

Fittingly, given this interpretation of h^l, Detweiler and Whiting showed that the 
MiSaTaQuWa equation (6) can be equivalently written as 

^ {2h^i,g - + 0(e\ (9) 

or (following Appendix 1) explicitly as the geodesic equation in the metric 



df^ 






( 10 ) 



where ^ is a covariant derivative compatible with ^ is the 

four- velocity normalized in and f is the proper time along as measured 
in Equation (9) is equivalent to Eq. (6) because on the worldline, h^l differs 
from only by (i) background Riemann terms that cancel in Eq. (9) and (ii) terms 
proportional to the worldline’s acceleration, which can be treated as effectively higher 
order because the acceleration is already e. 

Allow me to dwell longer on the interpretation of the regular field. Because is 
a smooth vacuum solution, at the particle’s position an observer cannot distinguish 
it from g^j^. Although a portion of g^j^ comes from the retarded field sourced by 
the particle, to the observer on the worldline, it appears just as would any metric 
sourced by a distant object. However, this interpretation of the effective metric as an 
effectively external metric is delicate. To effect the desired split into and 
both fields must be made acausal when evaluated off the worldline [9, 23]. More 
precisely, in addition to depending on the particle’s causal past, h^l{x) depends on 
the particle at spatially related points x' . So in that sense its interpretability as a 
physical external field is limited. Yet when evaluated on the worldline, h^l and its 
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derivatives are causal, and that is the sense in which appears as a physical metric 
on the worldline. 

I impress upon the reader the significance of these properties of they are 
what makes Eq. (10) a meaningful result. Although it may not be an obvious fact 
at first glance, any equation of motion can be written as the geodesic equation in 
some smooth piece of the metric. This is most easily seen by writing the equation of 
motion in a frame that comoves with the particle. In locally Cartesian coordinates 
{t,x^) adapted to that frame, such as Fermi- Walker coordinates [9], the particle’s 
equation of motion reads at = E/, where a^ = is the particle’s covariant 
acceleration and is the force (per unit mass) acting on it. Now suppose we were 
to define some smooth field h^^^ and a corresponding singular field 
In the comoving coordinates, the linearized geodesic equation in the regular metric 
+ h^^^, in the form analogous to Eq. (9), reads at = — i- No matter 
what force acts on the particle, the equation of motion at = Fi could be written as 
the geodesic equation in simply by choosing = 0 and h^^j / = 2F/ , 

for example. Besides those two conditions, the regular field could be entirely 
freely specified, and regardless of the specification we made, we would have defined 
a split = /z^j^ -h h^^^ in which /z^j^ is singular and exerts no force, and g^jj + h^^^ 
is a regular metric in which the motion is geodesic. 

Given this fact, it is of no special significance that the MiSaTaQuWa equation is 
equivalent to geodesic motion in some effective metric. But it is significant that the 
MiSaTaQuWa equation is equivalent to geodesic motion in the particular effective 
metric = g^^jj identified by Detweiler and Whiting, because of the par- 

ticular, ‘physical’ properties of that metric: g^^JJ is a smooth vacuum solution that 
is causal on the particle’s worldline. Because of those properties, we may think of 
the MiSaTaQuWa equation as a generalized equivalence principle: any object, if it is 
sufficiently compact and slowly spinning, regardless of its internal composition, falls 
freely in a gravitational field g^jj that can be thought of (loosely if not precisely) as 
the physical ‘external’ gravitational field at its ‘position’ . I stress that this is a derived 
result, not an assumption, as will be shown in Sect. 3. By that stage in the paper, the 
principle’s reference to extended, “sufficiently compact and slowly spinning” objects 
will have become clear. 

Another aspect that will have become clearer is the non-uniqueness of the effective 
metric and the limitations of interpreting it in a strongly physical way. Nonetheless, 
the split of the metric into a self-field and an effective metric will be a recurring theme. 
In many ways, the generalized equivalence principle just described is both the central 
tool and the core result of self-force theory. It is conceptually more compelling than 
the expression for the force in terms of a tail,^ it is less tied to any particular choice 



^Another compelling physical interpretation is provided by Quinn and Wald [24] . They showed that 
the MiSaTaQuWa equation follows from the assumption that the net force is equal to an average 
over a certain “bare” force over a sphere around the particle. (This assumption was later proved to 
be true in a large class of gauges [18, 19, 31].) In the language of Detweiler and Whiting fields, the 
force lines of the singular field are symmetric around the particle and vanish upon averaging, while 
the force lines of the regular field are asymmetric and add up to a net force on the particle. 
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of gauge [17], and it is often easier to use as a starting point from which to derive 
formal results. Perhaps most importantly, it is easily carried to nonlinear orders: 
as I describe in later sections, at least through second order in e, the generalized 
equivalence principle stated above holds true [14, 16]. 



1.2 Extended Bodies and the Trouble with Point Particles 



Although the point-particle picture seemingly works well within linearized theory, 
it is not obvious a priori how it fits within a systematic asymptotic expansion that 
goes to higher perturbative orders. For as I have reiterated above, one cannot use a 
point particle in the full nonlinear theory. Let me now expound on that point. 

Suppose we attempt to model the extended object as a point particle in the exact 
spacetime, with a stress-energy tensor 






/ 



m 



dz>^ dz^ , 

^ at 

dt dt 



( 11 ) 



where t is proper time on as measured in g^^. If we expand the metric as = 
Qiiu + 2n>o ll^^ linearized Einstein equation is exactly as described in the 

previous section. But at second order, severe problems arise. The second-order term 
in the Einstein equation G^^[g] = SttT^^ reads 

5Gf^^[h^] = -6^Gf^^[h\h^] -h (12) 



where h\ = ^ ^ Ia-o second-order term in 

There are two problems with Eq. (12). Most obviously, contains terms like 

As described in the preceding section, diverges as 1/r near the particle; hence, 
the stress-energy diverges in the distributionally ill-defined manner ^(5(r). Even if we 
could somehow mollify the ill behavior of this piece of the source, the other source 
term in Eq. (12) would still present a problem. 6^G^^[h^, h^] behaves schemat- 
ically as (dh^)^ + h^d^h^. Therefore, it diverges as 1/r^. Such a divergence is 
non-integrable, meaning it is well defined as a distribution only if it can be expressed 
as a linear operator acting on an integrable function. In the present case, there does 
not appear to be a unique way of so expressing it. 

These arguments make clear that a point particle poses increasingly worsening 
difficulties at nonlinear orders in perturbation theory. And it is well known that in 
any well-behaved space of functions there exists no solution to the original, fully 
nonlinear equation G^^[g] = SttT^^ with a point particle source [1, 2]. 
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Despite these obstacles, one might suppose that the point-particle model could be 
adopted and the divergences resolved using post-hoc regularization methods. Evi- 
dence for this reasoning is given by the successful use of dimensional regularization 
in post-Newtonian theory [5]. In the fully relativistic context, the most promising 
route to such regularization appears to be effective field theory [32, 33]. 

However, at a fundamental level, there should be no need for such regularization in 
general relativity. Outside of curvature singularities inside black holes, everything in 
the problem should be perfectly finite. For that reason, in this paper I will be interested 
only in formalisms that deal with finite, well-defined quantities throughout. 

So let us do away with the fiction of a point particle and think of an extended 
object. Perhaps the most obvious route, at least at first glance, to determining this 
object’s motion is to go to the opposite extreme from the linearized point particle 
model, by looking instead at a generic extended object in fully nonlinear general 
relativity; if one is interested in the case of a small object, one can always examine an 
appropriate limit of one’s generic results. This line of attack has been most famously 
pursued by Dixon [34] (inspired by early work by Mathisson [35]) and Harte [36]. 
Specializing to material bodies, Dixon showed that all information in the stress- 
energy can be encoded in a set of multipole moments, and all information in 
the conservation law = 0 can be encoded in laws of motion for the body. 

These laws take the form of evolution equations for some suitable representative 
worldline in the object’s interior and for the object’s spin about that worldline. A 
“good” choice of representative worldline may be made by, for example, defining 
the object’s mass dipole moment relative to any given worldline and then choosing 
the worldline for which the mass dipole moment vanishes, establishing the worldline 
as a center of mass [37]. However, in order to transform these general results into 
practical equations of motion, Dixon’s method requires an assumption that the metric 
and its derivatives do not vary much in the body’s interior. Given that assumption, 
the force and torque appearing on the right-hand side of the equations of motion can 
be written as a simple expansion composed of couplings of the metric’s curvature 
to the object’s higher multipole moments; the higher moments, beginning with the 
quadrupole, may be freely specified, and their specification is entirely equivalent to 
a specification of the object’s stress-energy tensor. Equations of motion of this form 
can be viewed in Eqs. (13) and (14) below. 

Unfortunately, for a reasonably compact, strongly gravitating body, the physical 
metric does not vary slowly in its interior, due to the body’s own contribution to the 
metric. To make progress, one must treat the object as a test body, an extended object 
that is non-gravitating but is equipped with a multipole structure — or, as discussed 
in Sect. 13 of Ref. [34], one must find an efficacious means of separating the object’s 
self-field from the “external” field, analogous to the trivial split in Newtonian theory 
and to the Detweiler- Whiting split in the linearized point particle model. A well- 
chosen “external” field will vary little in the body’s interior, a well-chosen “self-field” 
will have minimal influence on the motion, and one can hope to arrive at equations 
of motion expressed in terms of couplings to the curvature of the external field alone. 

In the fully nonlinear theory, finding such a split would seem to be highly non- 
trivial. Nevertheless, in a series of papers [26, 27, 38] culminating in Ref. [36] 
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(see also Harte’s contribution to these proceedings [28]), Harte has succeeded in 
finding a suitable split by directly generalizing the Detweiler- Whiting decomposi- 
tion. He has shown that the “self-field” he defines modifies the equation of motion 
only by shifting the values of the object’s multipole moments, and the object behaves 
as a test body moving in the effectively external metric he defines. This extends the 
Detweiler- Whiting result from the linearized model to the fully nonlinear problem. 
However, there is one caveat to this generalization: beyond linear order, Harte’s effec- 
tive metric loses one of the compelling properties of the Detweiler- Whiting field: it 
is not a solution to the vacuum Einstein equation. Despite this feature, Harte’s work 
is a tour de force in the problem of motion. 

The approach I take in this paper is complementary to Harte’s. Rather than begin- 
ning with the fully nonlinear problem, I will proceed directly to perturbation theory. 
There are several advantages to this. The perturbative approach naturally applies 
to black holes, while Harte’s formalism, because it is based on integrals over the 
object’s interior, is restricted to material bodies. The perturbative approach also nat- 
urally leads to a split into self-field and effective metric in which the effective metric 
satisfies the vacuum Einstein equation at all orders and is causal on the worldline. 
Most importantly, the perturbative approach provides a practical means of solving 
the Einstein equations. In the fully nonlinear approach, one arrives at equations of 
motion given the metric, but not a practical way to find that metric. 

In the next section, I will begin to discuss the perturbative formalism. Eirst, I 
note that with the work of Dixon and Harte, a new theme has been introduced: an 
object’s bulk motion can be expressed in terms of a set of multipole moments, and 
the f > 2 moments are freely specifiable. Like the decomposition of the metric into 
a self-field and effectively external field, this second theme will appear prominently 
in the remainder of this paper. 



1.3 When Perturbation Theory Fails Near a Submanifold: 

The Method of Matched Asymptotic Expansions 

As soon as we seek a perturbative description of the problem, we run into a new 
challenge. Say we assume an expansion of the exact spacetime (g^^. Me) about a 
background spacetime Mq), as in g^^ = -h -h O(e^), where all e- 
dependent terms are created by the small object (or by nonlinear interactions of its 
field with itself). This expansion assumes the object has only a small effect on the 
metric. Clearly, if the object is compact, this cannot be true everywhere: sufficiently 
near the object, where r ~ e, the object’s own gravitational field will contain a 
Coulomb term ~ m/r ~ — just as large as the background and because it 

varies on the spatial scale e rather than e^, having much stronger curvature than g^jj. 

This fact motivates the use of matched asymptotic expansions. At distances r ~ 
from the object (for example, r ~ M in an EMRI), we expand the exact spacetime 
around Mo), as above; I will call this the outer expansion. At distances r ^ e 
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from the object (or r ~ m in dimensionful units), we introduce a second expansion, 
+ 0{e^), where A^obj) is the spacetime of the object were 
it isolated, and the perturbations are due to the fields of external objects (and 
to nonlinear interactions); I call this the inner expansion. In a buffer region around 
the object, defined by e ^ r ^ e^, we assume a matching condition is satisfied: 
if the outer expansion is re-expanded in the limit r ^ and the inner expansion 
is re-expanded in the limit r ^ e, the two expansions must agree term by term in 
powers of r and e, since they both began as expansions of the same metric. The 
relationship between the various regions and expansions is shown schematically in 
Fig. 1, and it will be described precisely in Sect. 2. 

Historically, matched asymptotic expansions have been a highly successful way 
of treating singular perturbation problems in which the behavior of the solution 
rapidly changes in a localized region. For general discussions of singular perturbation 
theory in applied mathematics, I refer readers to the textbook [39], and for more 
rigorous treatments, to Refs. [40, 41]. For general discussions in the context of 
general relativity, I refer them to [12, 42]. 

In the context of spacetimes containing small objects, matched expansions 
have been the standard method of tackling the problem; Refs. [11, 14, 18, 20- 
22, 43^6] are but a small sample. When it comes to obtaining equations of motion. 




Fig. 1 The exact spacetime {top left), external background spacetime (top right), and inner back- 
ground spacetime {bottom), with maps between them. In the exact spacetime, we have some small 
compact object, here shown as a material body (in dark red and black), but possibly a black hole or 
more exotic object. Around the object is a buffer region, shown in orange. Points outside the object 
can be identified (via the identification map pf) with points in the external background manifold 
Mq. In a chart {t, x^) : M.q centered on a worldline 7 G M.q, the points p e M.q and 

Pe{p) £ -Ade are assigned the same coordinate values {t{p), x^{p)). The map o zooms in 
on the object by rescaling spatial coordinates; in the same chart {t, x^), the point q = 'ff^{pe{p)) 
is assigned coordinate values {t{q),x^{q)) = {t {p) , x\p) / e) 
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the method hearkens back to an early insight of Einstein and others [3, 47-49]: an 
object’s equations of motion can be determined from the Einstein equations in a 
region outside the object. Specifically, it can be determined from the field equations 
in the buffer region defined above. Relative to the object’s scale r ^ e, the buffer 
region e r ^ is at asymptotically large distances, allowing one to make an 
asymptotic characterization of how well “centered” the buffer region is around the 
object. One such characterization is based on a definition of multipole moments in 
the buffer region. Again, because the buffer region is at asymptotic infinity in the 
object’s metric we can define the object’s multipole moments by examining 
the form of the metric there, rather than having to refer to the object’s stress-energy. 
We can then use the metric’s mass dipole moment in the buffer region as a measure 
of centeredness: if we install a timelike curve 7 in the background spacetime and 
define the mass dipole moment in coordinates centered on that worldline, then 7 is 
a good representative worldline if the mass dipole moment vanishes. Centeredness 
conditions along these lines will be described in more detail in Sects. 2 and 7; for a 
schematic preview, see Eigs. 1 and 2. 

Prior to its application to self-force analyses [11, 14, 18, 20-22] , this program was 
pursued furthest by Thorne and Hartle [46] . Where Dixon stood relative to the later 
non-perturbative work of Harte, Thorne and Hartle stand in the same position relative 
to perturbative self-force constructions. As did Dixon, they derived general laws of 
motion and precession for compact objects. They considered an object immersed 
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Fig. 2 Perturbative motion of a small object as defined in the Gralla-Wald (left) and self-consistent 
(right) approximations. In the Gralla-Wald case, the buffer region is centered on a zeroth-order 
worldline 70 C Mo, which in a given coordinate system : Mo has coordinates Zq(s). 

In a generic gauge, the self-force drives the object (shown as a sequence of red circles) away 
from 70, and this deviation is represented by Zi(s), a vector defined on 70. In the self-consistent 
case, the buffer region is centered on an accelerated worldline 7^ C Mo, which in the same 
coordinates has coordinates z^(s, e). The accelerated worldline faithfully tracks the object’s 
motion, such that the object is always at the “center” of the region enclosed by the buffer. On 
timescales much smaller than the dephasing time, we have that the two approximations are related 
by z^(s, e) = Zq(s) + ezi(s) + O(e^) 
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in some external spacetime, say 9Jt), and found forces and torques made up 
of couplings between the external curvature and the object’s multipole moments; 
specifically, in Cartesian coordinates (t, that are at rest relative to a geodesic 
of the external spacetime (and in which the mass dipole moment vanishes), they 
found 



where and are the object’s linear and angular momentum relative to 
and are its “mass and current” quadrupole moments (see the ends of Sects. 3.2 
and 3.4), t is proper time (as measured in on and etjk is the fiat- space, 
Cartesian Levi-Civita tensor. The quantities (Bab and are the electric-type and 
magnetic-type quadrupole tidal moments of the external universe, which describe 
the tidal environment the object is placed in; they are related to the Riemann tensor 
of according to (Bab ~ and B^ab ~ 2^^^ (ci^b)ipq' 

Physically, Eqs. (13) and (14) say that the object moves as a test body in the 
external metric; the equations have the same structure as the test-body equations of 
motion mentioned above in the context of Dixon’s work [34]. Up to the coupling 
of the object’s moments to the tidal moments of the external universe, the motion 
is geodesic, and the spin parallel-propagated, in the external metric. But just as in 
Dixon’s work, these results become useful only once one knows how to split the full 
metric into a self-field and an effectively external metric. What Thorne and Hartle 
call the “external metric” is not but rather it is essentially defined to be whatever 
creates tidal fields across the object. As Thorne and Hartle themselves remarked, this 
“external” field includes a contribution from the object’s own field. (Likewise, the 
“object’s” multipole moments can be altered by the external field.) 

In this sense, the self-force game is played by finding a useful split in which Thome 
and Hartle’s general laws are valid. Doing so requires developing a systematic theory 
of matched expansions for spacetimes containing small objects, as described in the 
body of this paper. Using those matched expansions, we will find that at linear order, 
we can circle back to the point particle picture: as first shown by D’Eath [43, 44], the 
linearized perturbation in the outer expansion is identical to the linearized field 
sourced by a point particle (see also Refs. [11, 15, 1 8] for more refined derivations). 
And at that linearized level, we will find that the effectively external metric 
of Thorne and Hartle can be taken to be the Detweiler- Whiting effective metric 







(13) 



(14) 



9iiv — 9ni> + • 
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1.4 Gauge, Motion, and Long-Term Dynamics 

Despite all the above preparation, we are still not ready to broach the problem of 
motion in perturbation theory. Two additional facts must first be understood: in 
perturbation theory, motion is intimately related to gauge freedom [17, 19, 50]; and 
in problems of astrophysical interest, the most important dynamical effects occur on 
the very long time scale 1 /e [51]. 

To the first point. At leading order, the object’s motion is geodesic in the back- 
ground metric all deviation from that motion is driven by an order- e force. 
Suppose the self-accelerated worldline 7 is a smooth function of e. Then we can 
write its coordinates as an expansion 

e) = -h ezi{s) + O(e^), (15) 

where s is 3 . parameter on the worldline, and the zeroth-order term Zq is a geodesic 
of Now consider the effect of a gauge transformation. Under a transformation 
generated by a vector a curve z^ is shifted to a curve z'^ = z^ — + 0(e^). 

Nothing prevents us from choosing which leaves us with z'^ = Zq+ O(e^), 

entirely eliminating the first-order deviation from Zq. This same idea can be carried 
to arbitrary order, meaning we can precisely set = Zq. In other words, the effect 
of the self-force appears to be pure gauge. 

In one sense, this result is true. If we look at any finite region of spacetime and 
consider the limit e ^ 0 in that region, the deviation from a background geodesic 
is, indeed, pure gauge. This does not mean it is irrelevant: in any given gauge, it 
must be accounted for to obtain the correct metric in that gauge. But it need not be 
accounted for in the linearized metric. We can always substitute the expansion (15) 
into \ z) to obtain 

z) = ^o) + zo, zi) + O(e^), (16) 

and we can then transfer the term zo, Zi) into the second-order perturbation, 

[An explicit expression for ^l) is given in Eq. (221).] 

However, this analysis assumes we work in a fixed, finite domain — and as men- 
tioned in the first paragraph of this section, we do not typically work in such a domain 
in problems of interest. Consider an EMRI. Gravitational waves carry away orbital 
energy from the EMRI at a rate E/E ~ e. It follows that the inspiral occurs on 
the time scale Er E/E ^ 1/e, which is called the radiation-reaction time. So in 
practice, we are not looking at the limit e ^ 0 on a finite interval of time [ 0 , T], 
where T is independent of e; instead, we are looking at the limit e ^ 0 on a time 
interval [ 0 , T/e] that blows up. 

This consideration forces us to adjust our thinking about motion and gauge. 
Loosely speaking, the deviation from geodesic motion, ezp is governed by an 
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equation of the form 



On the radiation-reaction time scale, it therefore 



behaves as ezi ^ ~ 1/ e. In other words, it blows up in the limit e ^ 0. So on 

this domain, one cannot rightly write the worldline as a geodesic plus a self-forced 
correction, and one cannot use a small gauge transformation to shift the perturbed 
worldline onto a background geodesic; the gauge transformation would have to blow 
up in the limit e ^ 0. 

Because we are solving partial differential equations and not ordinary ones, these 
arguments about time scales translate into arguments about spatial scales. For exam- 
ple, if we seek a solution in a Schwarzschild background spacetime, fields (including 
inaccuracies in them) propagates outward toward null infinity along curves of con- 
stant r — r*, or inward toward the future horizon along curves of constant + r*, 
where r* is the tortoise coordinate. If one’s accuracy is limited to a time span A A 
then it is also limited to a spatial region of similar size. 

To organize our thinking, let us denote by a spacetime region roughly of size 
^(e) (both temporal and spatial). I call an asymptotic solution to the Einstein equations 
a “good” solution in Vc;{e) if h is uniform in V^;{e) • That is, the asymptotic expansion 



— 9fliy + Sn 



>0 



eh^ 

must satisfy lime^o = 0 and lime 



pn-lhn-l 



= 0 



uniformly (e.g., in a sup norm). 

For the EMRI problem, we are interested in obtaining a good solution in a domain 
T>i/e. Suppose we use an asymptotic expansion of the form (16) and incorporate 
zo, z\) into In a gauge such as the Lorenz gauge, grows as ~ 
and so \ ^i) likewise grows as Hence, on Pi/e, its contribution to 

behaves at best as e^, comparable to Clearly, this is not a good approximation. 
Suppose we instead eliminated z^ using a gauge transformation generated by = 
z^. This removes the offending growth in /z^^, but it commits a worse offense: it alters 
e/z^^ by an amount which behaves at best as et, or as on Pi/e- Hence, if 

we are in a gauge where the self-force is nonvanishing, /z^^ behaves poorly; if we 
are in a gauge where the self-force is vanishing, even behaves poorly. 

Let us chase the consequences of this. To obtain a good approximation in Vi /e, we 
need to work in a class of gauges compatible with uniformity in Pi/e- This means, 
in particular, that if we obtain a good approximation in a particular gauge — call it 
a good gauge — we must confine ourselves to a class of gauges related to the good 
gauge by uniformly small gauge transformations. In turn, this means that the effects 
of the self-force are not pure gauge on Pi/e- Due to dissipation, z^ will deviate 
from any given geodesic Zq by a very large amount in Di/e, but by using an allowed 
gauge transformation we may shift it only by a very small amount, of order e, on that 
domain. In other words, although the self-forced deviation from Zq is pure gauge on 
a domain like V^o, it is no longer pure gauge in the domain Di/e- 



^More precisely, it is governed by Eq. (35). 
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1.5 Self-Consistent^ Gralla-Wald^ 

and Osculating-Geodesics Approximations 

In the preceding sections, we encountered several core concepts pertaining to the 
motion of a small object: (i) the metric can be usefully split into a self-field and an 
effectively external metric in which the object behaves as a test body, (ii) the bulk 
motion of the object can be described in terms of forces and torques generated by 
freely specifiable multipole moments, (iii) by using matched expansions, all of this 
can be done in a vacuum region outside the object, where multipole moments can be 
defined from the metric and laws of motion can be derived from the vacuum Einstein 
equation, and (iv) the representation of the object’s bulk motion, the asymptotic 
expansion of the metric, and notions of gauge must all be tailored to suit the long 
timescales on which self-force effects accumulate. 

Let me now combine this information into a cohesive framework, mostly following 
Ref. [11] (but see also Refs. [14-17, 52]). The overarching method I describe consists 
of solving the Einstein equation with an “outer expansion” in some vacuum region 
outside the object, using only minimal information from the “inner expansion” 
to determine the behavior of the solution very near the object. 

Since we work in the vacuum region , we seek an asymptotic solution to the 
vacuum Einstein equation 



Ri,Ag\ = 0. (17) 

Eollowing the lessons of post-Newtonian theory, I write this equation in a “relaxed” 
form [5-7]. I define h^jj = g^^, — impose the Lorenz gauge condition^ 

LAh] ^ V^(V - = 0, (18) 

and write Eq. (17) with that condition imposed, making it read 

( 19 ) 

where is the wave operator introduced in Eq. (4) (but here acting on the metric 
perturbation rather than on its trace reverse),^ and the “source” = R^Aq + 
h] — E^Ah] is a nonlinear functional of h^jj. The background is chosen to be a 
smooth solution to R^Ag] = 0 (if matter exists outside the object, it is assumed to 
be sufficiently far away to lie outside T>^(e))- This makes Eq. (19) a weakly nonlinear 
hyperbolic equation for the perturbation at this stage, that equation is still exact. 



^Reference [15] describes how the entirety of this section can be performed in any gauge in which 
the linearized Einstein tensor is hyperbolic. Section 6 below offers a more general discussion of 
gauge. 

^In the Lorenz gauge in a vacuum background, the linearized curvature tensors are related by 
E^Ah] = -25R^Ah] = -25G^Ah] = E^Ahf 
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I wish to solve Eq. (19) in P^(e) subject to two types of boundary conditions: 

(BCl) Global boundary conditions. Examples of these are retarded boundary con- 
ditions or specified Cauchy data. 

(BC2) The matching condition. In the buffer region, the solution must be compatible 
with an inner expansion. 

Together these conditions ensure we are describing the correct physical situation. But 
they do not yet uniquely determine the solution. Equation (19) is called “relaxed” 
because, unlike Eq. (17), it can be solved no matter how the object moves; this 
relaxation arises because Eq. (19) is not constrained by the Bianchi identity, unlike 
Eq. (17). The object’s motion is determined only once the gauge condition is also 
imposed, thereby making the solution to the relaxed equation also a solution to the 
unrelaxed one. As discussed earlier, in the present problem, the motion of the object 
is defined by the mass dipole moment of the metric in the buffer region, and we 
will find that the evolution of that mass dipole moment is determined by the gauge 
condition. 

Beyond these broad ideas, the specifics of the method, and the size of the region 
Vc;{e) in which it applies, depend crucially on how one represents the object’s per- 
turbed motion, which determines how one formulates the asymptotic solution to the 
Einstein equation. Here I describe three representations and corresponding asymp- 
totic solutions: what I call “self-consistent”, “Gralla-Wald”, and “osculating geodes- 
ics” approximations. 

1.5.1 Self-Consistent Approximation 

In the self-consistent approximation, to avoid the secularly growing errors described 
in Sect. 1 .4, 1 seek to directly determine an accelerated worldline Ze (s) that represents 
the object’s bulk motion; I do not wish to expand that worldline in powers of e. To 
accommodate this, I write the perturbation as h^j^ix, e; Ze, Ze), where Ze = ^ 
and the quantities after the semicolon denote a functional dependence. I expand this 
functional as 



Despite the fact that Ze and Ze depend on e, they are not expanded; in other words, 
I hold them fixed while taking the limit e ^ 0. Later, I will suppress the functional 
dependence on Ze and simply write Ze)- 

The self-consistent representation of motion is given its name because Ze must 
be determined simultaneously with Ze)- It was the representation adopted in 

the original derivations of the MiSaTaQuWa equation [21, 24, 53], and it is the one 
I used in describing that equation in Sect. 1 . 1 . It was first put on a sound and robust 
basis, as part of a systematic expansion of the Einstein equation, in Ref. [1 1]. In this 
section, I outline that expansion. 




( 20 ) 



n>0 
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I first refine the region in which I seek a solution to the relaxed Einstein equa- 
tion (19). Install the timelike curve 7 e (with coordinates Ze) in the background space- 
time, and let D 7 e be a region of size <^(e). I define to be a region of proper 
radius centered on 7 e, with e ^ The region I seek a solution in is then 
27^^ . The inner boundary of this region lies in the buffer region, and 

there the solution must satisfy the matching condition (BC2). 



1.5.2 Field Equations 

Since the coefficients Ze) in the expansion (20) depend on e, it may seem they 

are not uniquely determined. However, here I define the functional e; z) to 

be the solution to the relaxed Einstein equation (19); the relaxed equation places no 
constraints on and each function : M ^ yields a different solution. (In 
the present context this means there are no constraints on the motion of the region 
B^.) The coefficients \ z) are then uniquely determined to be the solution to 
the /ith-order term in an ordinary power-series expansion of the relaxed Einstein 
equation. That ^th-order term has the form E^jy[h^] = . . . , Up to 



^ = 3, it reads 

Efiu[h^] =0 X e (21a) 

E^Ah^] = 25^R^Ah\h^] x e (21b) 

E^Ah^]=2S^R,,Ah\h\h^]+4S^R^Ah\h^] x (21c) 

where I have defined the “^th-order Ricci tensor” to be the ^th variation^ 

1 



In the concrete calculations in this paper, I will require only which is given 

explicitly by 

^ Ra(5\h’> ~ 2^^ 5^ (2^/x(q;;/ 3) “f ^z2q;;/x) 

3 “ 4 ^^ 2 ^^ (^^/ x ( q ;;/ 3 ) z 2 haj3',fiv hfiv,al3) • ( 23 ) 

Solving Eqs. (21) subject to the boundary conditions (BC1)-(BC2) yields a 
functional-valued asymptotic solution to the relaxed Einstein equation (19). Now 
we must find a particular function for which e; z) is also an asymp- 

totic solution to the unrelaxed equation (17). To do this, we must ensure h^jj{x, e; z) 
also satisfies the gauge condition (18) to the same order. 



^Cross terms like , h^] are as defined in Eq. (227). 

^^Equation (222) illustrates more explicitly, using a point-particle field, how the gauge condition 
implies an equation of motion. 
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I accomplish this in a systematic way by writing the accelerated equation of motion 
of Ze as 



a^^^ = F^{r,e), (24) 

and then assuming that like h^jy, the force (per unit mass) appearing on the right-hand 
side can be expanded as 



F^(r,e) =;^e"F„^(r;ze,Ze). (25) 

n>0 



I substitute this expansion, together with the one in Eq. (20), into the gauge condi- 
tion (18) and solve order by order in e while holding {ze , Ze) fixed. By holding (z^, Ze) 
hxed during this procedure, rather than expanding their e dependence, I preserve the 
particular accelerated worldline that satishes some appropriate mass-centeredness 
condition, such as the vanishing of a suitably dehned mass dipole moment in the 
buffer region centered on 7 e. Solving the sequence of gauge conditions to higher and 
higher order yields a better and better approximation to the equation of motion of 
that particular worldline, without ever expanding the worldline itself. The hrst few 



of these gauge conditions are 

=0, (26a) 

5L^[h\Fx] = -Li,[h^,F^], (26b) 

5L^[h\ F 2 \ = -L^[h\ Fo] - 5L^[h^, Fi] - 5^Li,[h\ Fi, Fi], (26c) 

where I have dehned L^[/z, F] = L^lh] and 

1 d^ 

5’'L,[h, F, . . . , F] ^ 



This sequence determines the forces F^ thereby determining the equation of 
motion (24). 



1.5.3 Solution Method 

We solve Eqs. (21) and (26) by working outward from the buffer region. Solving them 
in the buffer region using a local expansion, subject to (BC2), yields two things: the 
local form of the metric outside the object, and an equation of motion for the object. 



^Tt also constrains other quantities in h^y, particularly determining the evolution of the object’s 
mass and spin. 
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The local form of the metric is described in Sect. 3. In line with the themes 
of the earlier sections, it allows a natural split into an effectively external metric 
gfiiy = Qiiv + ^ self-field where satisfies all the “nice” 

properties of the Detweiler- Whiting regular field. 

Derivations of the equations of motion at first and second order are sketched in 
Sects. 3 and 7. If at leading order the object’s spin and quadrupole moment both 
vanish, then the equation of motion is [14, 16]. 



1 









+ O(e'), 



(28) 



where = eh^l -h -h O(e^) . Following the steps of Appendix 1 , this equation 

of motion can also be written as = O(e^), the geodesic equation in the effective 
(smooth, vacuum) metric In other words, at least through second order in e, the 
generalized equivalence principle described in Sect. 1.1.2 holds. 

After obtaining the local results in the buffer region, one might think to solve 
Eqs. (21) and (26) globally in <^(e) by imposing agreement with the local results 
on the inner boundary and then moving using the equation of motion. 
However, in practice, a global solution is instead obtained by analytically extending 
the buffer-region results into S^^\ 7 e, replacing the physical metric there with the 
fictitious, analytically extended metric, while insisting that outside B^^, the metric 
is unaltered. This procedure (described in detail in Sect. 4) allows us to work with 
field equations on the whole of D<^(e) = ^ 7 e,<^(e) U B^^. At order e, the procedure 
reveals that \ Ze) in ^^ 7 e,^(e) is precisely equal to the perturbation produced by 
a point mass moving on zf , as promised in Sect. 1.1.1. More generally, at all orders, 
it leads to a practical puncture scheme [15, 52, 54-57], in which the puncture /z^^, 
a local approximation to /z^^, moves on 7 e, and the field equations in B^^ are recast 
as equations for a residual field /z^ that locally approximates h^^}^ 

The setup of a puncture scheme is compactly summarized in Eqs. (98)-(100) 
below. Using this scheme, one can directly solve for the effective metric on the 
worldline and use it to evolve that worldline via the equation of motion, and at the 
same time one can obtain the physical metric outside B ^^ . Although we begin with 
a potentially complicated extended object, this scheme illuminates the fact that in 
self-force theory, we do not need to know anything about the particularities of that 
object: at the end of the day, all necessary physical information about it is absorbed 
into the puncture and the motion of that puncture. 



^^Note that although a puncture scheme utilizes approximations to and it is designed to 
exactly obtain (and any finite number of its derivatives) on the worldline, meaning it does not 
introduce any approximation into the motion of 7^. Nor does it introduce approximations into the 
physical field = h^y + h^y = h'^y + h^y. 
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1.5.4 Accuracy Estimates 

How accurate will this self-consistent approximation be on a domain Let 

us make the reasonable assumption that the largest secularly growing error in the 
approximation arises from truncating the expansion (25) at some order leading 
to an error in of order Sz^ ^ The largest error in is then 

Error in ~ ■ (5z“ = 0(e''+^t^). (29) 

The order of accuracy depends on the size of the domain we work in. Suppose we 
work in corresponding to the radiation-reaction time. On that domain, the 

error from neglecting is * 1/e^) = 0(e^). Therefore, if we include only 

in the equation of motion, solving (via a puncture scheme) the coupled system 
comprising Eqs. (21a) and (9), then the result contains errors 0(e^ • 1/e^) = 0(e ) — 
which is as large as our first-order perturbation. In other words, this approximation 
fails on Pi/e- If in addition we include F^, solving the coupled system comprising 
Eqs. (21a), (21b), and (28), then the error is 0(e^ • 1/e^) = O(e^); hence, with this 
approximation, we can have faith in our field z). 

In the smaller domain corresponding to the so-called dephasing time 

^dph the approximations are more accurate. Including only F^ , by solving 

Eqs. (21a) and (9), yields a first-order-accurate solution with errors 0{e^ • 1/e) = 
O(e^). Including F^, by solving Eqs. (21a), (21b), and (28), yields a second-order- 
accurate solution with errors 0(e^ • 1/e) = O(e^). 



1.5.5 Gralla-Wald Approximation 

I next consider the Gralla-Wald representation of perturbative motion, named after 
the authors of Refs. [18, 20]. In this approximation, as in Eqs. (15) and (16), one 
considers the effect of the self-force to be a small perturbation of the worldline, and 
one expands the worldline as 



n>0 



(30) 



where the terms measure the deviation of z^ from the zeroth-order worldline 
Zq. Substituting this expansion into Eq. (20), one obtains a new expansion of the 
metric perturbation: 

h^i,(x, e; z,z) = zo, ■ ■ ■ , Zn-i)- (31) 

n>t) 



^^Here I return to what will become my common practice of dropping the subscript e on for 
simplicity, though I refer to the self-consistently determined center-of-mass worldline, not the freely 
specifiable worldline for which the relaxed Einstein equation can be solved. 
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Sections describes this expansion in some detail. Explicitly, at first and second 
order, zo) = zo) and zo, zi) = h^^ix; zo) + zo, zi), 

as described below Eq. (16). Since the individual terms Zn(s) are independent of 
e in Eq. (30), Eq. (31) is an ordinary expansion in which the coefficients do 
not depend on e, unlike in Eq. (20). Rather than starting from the self-consistent 
representation and then expanding the worldline, one could instead simply start with 
this ordinary expansion, as was done by Gralla and Wald. In this paper, to explicate 
the relationship between the two, I will instead almost always derive Gralla- Wald 
results from self-consistent results. 

In the Gralla- Wald approximation, the role of the domain is played by 

'D'josie)^ which excludes a small region around 70 (the worldline with coordinates 



Zq). In that region Eqs. (21) become 
E^i/\h ] = 0 V G 

E^u[h^] = 25^ h^] x e X> 7 o.<;(e)’ 

= 26^R.Ah\h\ h^] + 46^RaAh\P] 



(32a) 

(32b) 

(32c) 



The gauge conditions (26) become simply 

= 0 . 

The equation of motion (28) becomes a sequence of equations for z« : 

^4 






dTo 

D^zt 

drl 

D^2F 



= 0 , 



El (to; zo) - R'^a/3'rUQZiu2, 



dr I 



0 



= 7(ro; zo, zi) - (“O^^f “d + 2.u°Jlu^^ 



”T 2/v a(3^\5Zi Uq Zi Uq , 



(33) 

(34) 

(35) 

(36) 



where tq is proper time on 70 as measured in Uq = , and the forces 

and are constructed from according to Eqs. (206)-(207). The Riemann 
terms in these equations of motion are geodesic-deviation terms; they correspond to 
the fact that even in the absence of a force, two neighbouring curves z^ and Zq will 
deviate from one another due to the background curvature. Appendix 1 describes 
how Eqs. (34)-(36) are derived from Eq. (28). As explained more thoroughly there, 
the quantities and Z 2 f vectors that live on 70. Z 2 f is defined by applying 
the expansion (30) in a normal coordinate system centered on 70; it is related to Z2 
in any other coordinate system by the coordinate-dependent relation z^/r = Z2 + 
5r^^p(zo)ziZp 
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Just as in the self-consistent case, we can work with field equations on = 
U Bjq by replacing the physical metric in with the analytical extension of 
the buffer-region metric. Re-expanding the results from the self-consistent case, we 
find that (x ; zo) in is identical to the perturbation sourced by a point parti- 
cle moving on Zq ; the expansion of z) around zo) is derived in Appen- 

dix 2. We also arrive at a substantially simplified practical puncture scheme: rather 
than having to solve for z^ and together, as a coupled system, one can first spec- 
ify a geodesic Zq and then calculate in sequence (i) the perturbations zo) and 

h^l(x; zo), (ii) the deviation z^ driven by (iii) the perturbations h^^jy{x; zo,zi) 
and h^^(x; zo, Zi), (iv) the deviation Z2 , and so on. At all orders, the puncture moves 

on Zq ; the deviations z^>q, through their appearance in merely alter the singu- 
larity structure of the puncture. This puncture scheme is compactly summarized in 
Eqs. (117H119) below. 

On what domain is the Gralla-Wald approximation valid? There is no obvious 
estimate for the rate of growth of the terms z^>^, but as in Sect. 1.4, we can easily 
estimate the growth of ez^ to be of order et^}^ If we assume that the dominant error 
in arises from the z^ term in then we have 

Error in ~ = 0(e^t^). (37) 

If this estimate is valid, we can make the approximation valid to any order on domains 
^70, ^(e) with sufficiently small <;(e) tdph = Vv^- On domains comparable to the 

dephasing time, v^’ deviation vector ez^ becomes of order 1, the second- 
order metric perturbation becomes as large as the first, and the expansion of the 
worldline ceases to be sensible. If higher-order deviations z(^> ^ grow large much more 
quickly than Zp as we might surmise from Eq. (36), then the domain of validity of the 
expansion may be substantially smaller than even for first-order accuracy; 

if at each higher order the deviations grow more rapidly than the last, it may be the 
case that the domain of validity cannot be extended beyond • 



1.5.6 Osculating Geodesics 

Einally, I consider an approximation intermediate between the self-consistent and 
Gralla-Wald expansions, one which makes use of both the expanded and unexpanded 
representations of the worldline. Starting from the self-consistent representation, 
the idea is at each instant r on 7, to perform a Gralla-Wald expansion in a region 
^7o(r),^(e)’ where is a geodesic of that is instantaneously tangential to 7 at 
time r; 707) is called an osculating geodesic [59].^^ By solving the field equations 



^^See Ref. [58] for an explicit solution to Eq. (35) in a particular scenario. 

^^The scheme I describe here should not be confused with the general method of osculating geo- 
desics, which is simply a way of using instantaneously tangential geodesics to rewrite an equation 
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of the Gralla-Wald approximation, one may calculate the self-force at time r, and 
then use that force to evolve to the next time step. By following this procedure 
at each time step, one eventually obtains 7 over the entire timespan of interest. The 
terms in the self-consistent approximation z) = 2 e 

be found simply by solving Eqs. (21a)-(21c) (and higher-order analogues) with 7 
already pre-determined. 

More concretely, at each instant r on the worldline, one perform a Gralla-Wald 
expansion z^(r', e) = ^ where r is the specific instant of interest and r' 

is variable, and one substitutes this expansion into the right-hand side of Eq. (25) — 
but not into the left-hand side — to obtain 



dr^ 



= eFf (r; zo(r)) + zo(r), Zl(r)) + 



(38) 



The forces on the right-hand side are constructed from fields h^l(x; zo(r)) and 
h^^(x; zo(r),Zi(r)) (and higher-order fields for higher order forces) according to 
Eqs. (206) and (207). Eor example, in terms of the tail of the perturbation, (r; zo(r) ) 

is given by the order-e term on the right-hand side of Eq. ( 6 ) but with the tail 
integral (7) evaluated over Zq^^^ rather than over These fields are found by 
solving the following sequence of equations at each value of r: (34) for the 
osculating geodesic z^^^y (32a) for hj^j,(x; zo{r)) in (35) for z^^y and 

(32b) for zo(r), zi(r)) in This suffices to compute Ff(r; zo(r)) 

and ^^(t; zo(t). Ziy)), but in principle the calculations could proceed to higher 
order. All of these equations are to be solved subject to the “osculation conditions” 
Zq7) = Z^ij), u^^yir) = w^(r), z(^>o(r) = 

that 7 o(r) is tangential to 7 at time r. They must also be solved subject to boundary 
conditions that ensure the sum Xn>o ^O(r)) agrees with the full field 

in at nonlinear orders, finding those boundary conditions may be highly 

nontrivial. 

At linear order, the osculating-geodesic approximation has already been con- 
cretely implemented to find z^ using the force F^(r; zo(r)) [60-62] (building on 
the framework in Ref. [59]) and to compute hj^^(x; z) [61, 62]. However, to my 
knowledge, the brief sketch above is the first time it has been described at nonlin- 
ear orders (although possibly equivalent ideas have been presented by Mino [63]). 
Considerably more work must be done to establish that the scheme is viable beyond 
linear order. If it is, then one may naively estimate that it is valid to the same order 
on the same domain as the self-consistent expansion: including only F^^ (r; zo(r)) in 



(Footnote 15 continued) 

of motion in terms of more convenient variables; that general method, inherited from 

celestial mechanics, is exact and does not inherently involve an expansion of z^, although it is 
particularly well suited to the osculating-geodesic approximation discussed here [59]. 
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Eq. (38) yields an approximation valid up to 0(e^) errors in a domain and 

including both zo(r)) and F^^r; zo(r)^ ^i(r)) yields an approximation valid 

up to 0(e^) errors in a domain 



1.6 Outline of This Paper 

As discussed above, the bulk of this paper focuses on the formalism presented in 
Ref. [11] and further developed in the series of papers [12, 14-18, 52]. 

I begin in Sect. 2 with a more complete description of matched asymptotic expan- 
sions, the concepts of which underly most of self-force theory. I focus on the impor- 
tance of the buffer region and formulating appropriate definitions of the small object’s 
representative worldline. 

Sections 3 and 4 present an algorithm for constructing an ^th-order expansion in 
the self-consistent approximation. In Sect. 3, I describe how to obtain the general 
solution in the buffer region. This algorithm determines the equation of motion as 
well as a natural split of the general solution into a self-field and an effectively 
external field. In Sect. 4, I describe how to generate the global solution in <^(e), 
using as input the self-field obtained in the buffer region. Along the way, I show how 
at linear order, the point-particle picture is recovered, and how at nonlinear orders, a 
certain point-particle “skeleton” can be defined to characterize the object’s multipole 
structure. 

In Sect. 5, I show how to recover a Gralla-Wald expansion or an osculating- 
geodesics expansion from the results of the self-consistent expansion. 

In Sect. 6, 1 discuss the gauge freedom in each of the three types of expansions — 
self-consistent, Gralla-Wald, and osculating-geodesics — and how that freedom 
relates to the representation of perturbative motion in each. 

In Sect. 7, 1 sketch how one can derive equations of motion by obtaining further 
information about the inner expansion and then utilizing the relationship between 
gauge and motion. This is an alternative to the algorithmic approach in the buffer 
region, and it is the method that has been used in practice to derive second-order 
equations of motion [14, 16, 20]. 

I conclude in Sect. 8 with a summary and a discussion of future directions. 

The appendices contain more general results: expansions of the geodesic equation 
in a perturbed spacetime, expansions of point particle functionals of an accelerated 
worldline around a geodesic, and some identities pertaining to gauge transformations 
of curvature tensors. 
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2 Matched Asymptotic Expansions 
2.1 Outer, Inner, and Buffer Expansions 

Traditionally in applied mathematics, the method of matched asymptotic expansions 
has been used to find both inner and outer expansions to some desired degree of 
accuracy and then combine them to obtain a uniformly accurate solution in the entire 
domain. But for my purposes here, I will not be interested in obtaining an accurate 
solution in the region r ~ e; instead, I will be interested in the inner expansion only 
insofar as it constrains the outer expansion. Finding an accurate inner expansion 
would require specifying the type of compact object we are examining, while here 
I am interested in generic results that apply for any compact object. As it turns 
out, finding those generic results requires only minimal knowledge of the inner 
expansion — in fact, just its general form in the buffer region suffices. 

To build toward that conclusion, let me describe the formalism more geometrically. 
I focus for the moment on the self-consistent case. In the outer expansion, we wish 
to approximate the exact spacetime (g^^, Me) in the domain outside a small 

region of size r around the object. I expand the exact spacetime around a 

background {Qhu, Mq) by adopting some identification ip : C A^o ^ 

Me between the spacetimes (in the region outside the inner region). In a given 
coordinate system : Mq the identification map assigns points p G A^o 

and pip) G Me the same coordinate values x^{p).\ next write as a functional of 
a worldline 7 c Mq as g^^^ix, e; z) in those coordinates, where z^is) = x^i^is)), 
and I then expand for small e while holding both x^ and z^ fixed. I thence arrive at 
Eq. (20).^^ Note that z^ is defined in the background, not in the perturbed spacetime; 
the identification map is assumed to exist only in 22 ^, ^(e), not in the inner region 
where there may lie a black hole rather than an identifiable worldline, for example. 
Later, I will replace the physical metric in the interior of the object with the effective 
metric , and via the identification map, the worldline will have identical coordinate 
values in the effective spacetime ig^jy, 7 W) as in the background. 

Now, the inner expansion is constructed by choosing some coordinates (t,x^) 
centered on 7, and then rescaling spatial distances according to x^ = x^ je. The inner 
expansion is performed by expanding for e ^ 0 while holding the scaled coordinates 
(t,x^) fixed, as in^^ 



^^In truth, it is unlikely that any of the expansions I consider, whether self-consistent, Gralla-Wald, or 
osculating-geodesics, is convergent. More likely, they are asymptotic approximations only. So when 
performing the self-consistent expansion, I actually assume that |g^j^(x, e) — g^^ix, e; z) | = 

where g^j^(x, e; z) = Qpvix) + X^=i ^)- The notation o(k(e)) means “goes to zero faster 

than k(e)”. 

^^Here indices refer to the unsealed coordinates If components are written in the scaled 

coordinates, overall factors e and appear in front of ta and ab components, respectively. These 
overall factors have no practical impact. 
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e) = x^) + x«; z). (39) 

n>\ 



Tensors in this expansion live on a manifold A^obj , where , A^obj) is identified as 
the object’s spacetime were it isolated. The perturbations x^\ z) describe the 

effect of interaction with the external spacetime. What is the meaning of this expan- 
sion? The scaled coordinates serve to keep distances fixed relative to the object’s 
mass in the limit e 1, effectively zooming in on the object by sending all distances 
much larger than the mass off toward infinity. The use of a single scaling factor makes 
the approximation most appropriate for compact objects, whose linear dimension is 
comparable to their mass. Scaling only distances, not A is equivalent to assuming the 
object possesses no fast internal dynamics; that is, there is no evolution on the short 
timescale of the object’s mass and size. 

Note that the treatment of the “region around the object” depends strongly 
on whether one considers a self-consistent or Gralla-Wald expansion. In the self- 
consistent case, the outer expansion takes the limit as the object shrinks toward zero 
size around the self-consistently determined, accelerated worldline 7 C Af 0, and the 
inner expansion blows up a region around that accelerated worldline; in the Gralla- 
Wald case, the outer expansion takes the limit as the object shrinks to zero size around 
the zeroth-order, background geodesic 70, and the inner expansion blows up a region 
around that background geodesic. 

In either case, the relationship between the two expansions is illustrated in Fig. 1. 
I now use this relationship to feed information from the inner expansion out to the 
outer expansion. This exchange of information is done in the buffer region around 
the object. From the perspective of the inner expansion, the buffer region lies at 
asymptotic infinity in Afobj- In that region, the inner expansion can be expressed in 
unsealed coordinates as 



x°/e) + x"/e\ z) (40) 

n>\ 



and then re-expanded for small e (or equivalently, expanded for r ^ e; i.e., for 
distances that are large on the scale of the inner expansion). Conversely, from the 
perspective of the outer expansion, the buffer region lies in a tiny region around the 
worldline. Hence, in that region the outer expansion can be expanded for r 
(i.e., for distances that are small on the scale of the outer expansion). Since the inner 
and outer expansions are assumed to approximate the same metric, it is assumed that 
the results of these re-expansions in the buffer region must match order by order in 
both r and 



^^This matching condition amounts to the assumption that nothing too “funny” happens in the buffer 
region. It can instead be replaced by more explicit assumptions on the behavior of the full metric 
g^j^, such as the conditions assumed in Ref. [18] or various others discussed in Ref. [41]. 
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Now, say the ^th-order outer perturbation is expanded as x^) = 

(t,n^) in the buffer region, where /r}^ One must allow negative powers of 

r, since at least part of the field will fall off with distance from the body. But there is 
a bound on the most negative power at a given order in e: Since the inner expansion is 
assumed to be well behaved, it must include no negative powers of e. And since any 
term in must coprrespond to a term in the inner expansion, if is written 
as a function of the scaled distance r = r/e, it must likewise have no negative powers 
of e. From this it follows that 

x') = «') + 0(e"r-”+i). (41) 

Any higher power of 1 / r would induce illegal powers of e; for example, . 

We can go one step further with this general analysis. Since /r^ is independent 
of e in the scaled coordinates, and is the only term in the inner expansion (39) 
that does not depend on e, it must be that is equal to a term in . If we write 

in terms of the unsealed coordinates and expand for r ^ e, we find its 
form in the buffer region is^^ 

= (42) 

n>0 



From the matching condition, we then have 



h 



n,—n 

ijLiy 



= g 



obJ,n 

fiiy 



(43) 



Therefore, at each order in e, the most singular (as a function of r) piece of the 
metric perturbation /z”^ in the buffer region is determined by the r ^ e asymptotic 
behavior of the object’s unperturbed metric. In addition, note that because quantities 
in the inner expansion vary slowly in time relative to their variation in space, they 
are quasistationary: after changing to scaled coordinates, a derivative with respect 
to t effectively increases the power of e relative to a derivative with respect to 
Hence, on any short time, we can think of g^J being stationary, and we can write 
its asymptotic form (42) in terms of a canonical set of multipole moments [64, 65]. 
So our final statement is that the inner expansion constrains the outer expansion to 



r terms also generically arise. For simplicity, I incorporate those terms into for the moment. 
Their presence does not spoil the well-orderedness of the expansion, since (In r)^ r^' (In r)^' 

for p > p' . Similarly, Ine terms can occur in solving the relaxed Einstein equation [15], and I 
absorb them into the coefficients z). 

^®The fact that the inner background must be asymptotically fiat, containing no positive powers of 
r, follows from the assumption that the outer expansion contains no negative powers of e, in the 
same manner as the cutoff on powers of 1/r in Eq. (41). 
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have the form (41), and the coefficients ^ can be expressed in terms of multipole 

moments of the small object’s unperturbed spacetime , Afobj)- This is all the 
information that will be required from the inner expansion (until we get to Sect. 7). 



2.2 Defining the Worldline 



Let us now use the above formalism to define what we mean by the object’s world- 
line. First consider the self-consistent case. In the coordinates (t,x^) centered on 7 , 
calculate the mass dipole moment of the spacetime . A mass dipole moment 

indicates the position of the center of mass relative to the origin of the coordinates. 
If the coordinates are mass -centered, a Coulomb-like piece of the field behaves as 
m/r; if the origin lies slightly away from the center of mass, by an amount , then 
the Coulomb-like field behaves as m/|v^ ~ - Expanding this around = 0, we 

find m/r -h m^in^ -h 0(|^p). The quantity m^^ is the mass dipole moment ML 
Ergo, if this mass dipole moment vanishes in the coordinates centered on 7 , then 
the object is appropriately centered “on” 7 , and we identify 7 as a good representa- 
tive worldline.^^ Because the definition only utilizes quantities in the buffer region, 
this definition makes sense even if the object is a black hole or contains topological 
oddities such as a wormhole: even if there exists no identification map between the 
background and the exact spacetime in the region inside the object, at least in the 
buffer region the coordinates (t,x^) can be used to chart the manifolds Me and Moh ] , 

and in those coordinates the metrics g^^ and g^^^j have reference to the worldline 7 

that is defined only in Mq. I refer the reader again to Fig. 1 to illuminate this. 

Now consider the Gralla-Wald case. Here 70 is the worldline around which the 
inner expansion is performed. In a generic gauge (in particular, in the Lorenz gauge I 
work in), the small object’s center of mass deviates from this worldline, as assumed 
in the expansion (30). Gauges in which the object does not deviate from 70 (on short 
timescales) are discussed in Sect. 7, but in a generic gauge, clearly we do not have 
= 0 in coordinates centered on 70 . Instead, we calculate in those coordinates 
and then define the first-order correction to the motion to be 

z\ = /m. (44) 

Figure 2 illustrates the difference between this setup and the self-consistent one. 



^^This notion of mass-centeredness based on the mass dipole moment of applies only to order-e 

deviations from For higher-order deviations, mass-dipole-moment terms in the perturbations 
must also be considered, or some other copacetic centeredness condition must be imposed, as 
discussed in Sects. 3 and 7. 
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3 Algorithm for an nth-Order Self-Consistent 
Approximation: General Solution in the Buffer Region 

We are now positioned to actually obtain an outer expansion. In this section, I present 
an algorithm for finding the outer expansion in the buffer region. In Sect. 4, 1 describe 
an algorithm for obtaining a global solution using as input the results from the buffer 
region. In both cases, I specialize to the self-consistent case. 

My method of finding the general solution in the buffer region is modeled on 
the post-Minkowskian methods of Blanchet and Damour [ 66 ] . Like them, I write 
the general solution in terms of a set of algorithmic multipole moments, which are 
defined simply from the algorithm of solving the relaxed Einstein equation, but a 
subset of which can be concretely identified with the physical multipole moments 
of . All the moments are completely unconstrained so long as we solve only the 
relaxed Einstein equations (21). Once we impose the gauge condition, as in Eqs. (26), 
the moments become constrained, and in particular, evolution equations arise for the 
mass monopole, mass dipole, and spin dipole moments. The equation for the mass 
dipole moment will be used to identify the center-of-mass worldline 7 for which 
= 0 . 

The solution thus obtained will be a general solution in the sense that it is made 
to satisfy the matching condition (BC2) but not any particular global boundary con- 
ditions (BCl). (Refer back to the opening portion of Sect. 1.5.) Because the explicit 
calculations, as well as their results, are exceedingly lengthy, I merely sketch the 
algorithm and the form of the results. I refer the reader to Refs. [9, 11, 15, 52] for 
more detailed expositions. 



3.1 Setup 

Eor concreteness, I work in Eermi- Walker coordinates (t,x^) centered on 7, in which 
t is proper time on 7, r = ^Sijx^xJ is the proper distance from 7 along a spatial 
geodesic (3 that intersects 7 perpendicularly, is a unit radial vector that labels 



the direction along which (3 is sent out, and the spatial coordinate is x^ = rn\ 
Reference [9] contains a pedagogical introduction. 

The background metric in these coordinates is given by 

9tt = -(1 + aix'f - Roiojx'xj + O(r^), (45a) 

gta = -iRoiajx'xi + 0 (^ 3 )^ ( 45 ^) 

9ab — ^ab ^ RaibjX X'^ + 0{r ), (45c) 



where the Riemann terms are evaluated on the worldline and contracted with members 
of a tetrad (u^, e^) on 7 that satisfies ^ = e^ for any coordinates y^. Eor example. 
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^0iaj{t) = Raii( 3 iyiz{t))u^ ef eae^j . An overdot will indicate a covariant derivative 

along the worldline, such as R^iaj = . Relating back to the 

discussion below Eqs. (13) and (14), the Riemann tensor on 7 can be written in terms 
of two tidal moments: 



^ab = ROaOb') ^ab = i^R^)^Pq ^ 



(46) 



and Rabcd — ^ac^bd ^bd^ac ^ad^bc ^bc^ad- 

Reference [15] displays the metric (45) to higher order in r. At all orders in r, 
all terms are made from the acceleration, Riemann tensor, and derivatives of the 
Riemann tensor. Because the coordinates are tethered to an e-dependent worldline, 
the background picks up an e dependence, seen most obviously in the accelera- 
tion terms . However, this is purely a coordinate effect. In any global coordinates 
covering is e-independent by definition. In a Gralla-Wald expansion, 

the e-dependent coordinate transformation to Fermi-Walker coordinates would be 
expanded in powers of e, splitting it into an e-independent coordinate transformation 
(to Fermi coordinates centered on 70 ) plus a gauge transformation. Because I wish 
to at no point expand I do not expand the transformation in this way. 

In these coordinates, I expand as in Eq. (41). I further assume that the coeffi- 
cients in that expansion are smooth, and I make logarithms explicit; these logarithms 
appear generically in solutions to inhomogeneous hyperbolic equations. We then 
have an expansion of the form 



-^max ^max 
p>-n i=0 q=0 



where L = /i • • • is a multi-index, is a smooth function of t, and = 

... is an STF combination of unit vectors. The decomposition in terms of 
is equivalent to an expansion in spherical harmonics, and like a spherical harmonic, 
is an eigenfunction of the fiat-space Laplacian, satisfying d^dih^ = — 
References [ 66 , 68 ] contain excellent introductions to this type of decomposition, 
along with many useful identities. Favoring over angles (0, 0), and fi^ over spher- 
ical harmonics, is useful because gf^i^ is naturally written, as above, in terms of 



^^Intuitively, the logarithms are caused by the object perturbing the spacetime’s light cones. One 
can expect the solution to the exact Einstein equation to propagate on (and within) null cones of the 
exact spacetime, and given that the mass of the body induces a logarithmic correction to the retarded 
time, logarithmic corrections then naturally appear in This effect is well known from solutions 
to the Einstein equation in harmonic coordinates (see, e.g.. Refs. [66, 67]). Eor generality, I allow 
logarithms at any value of n, but I assume that for each finite n, p, and I, the highest power of In r 
is a finite number qmax(n, p, f). Eor simplicity, to make sure that term-by-term differentiation is 
valid without worrying about issues of convergence, I also assume for a given, finite n and p, I has 
a maximum £max (n, p). 
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3.2 Seed Solutions 



By definition, is the solution to the ^th-order relaxed Einstein equation, which 
has the form E^jy[h^] = , . . . , Before considering the general solution 

to this equation, let us first consider solutions to the corresponding homogeneous 
equation = 0. We shall find that every solution of the form (47) is a member 

of a set z, I^) : 0 < I < n} or of a set z, k^) : ^ > 0}, or it is a 

superposition of such members. For a given background metric, the seed solutions 
z, I i) and iht free solutions z, k^) are completely determined by 

the worldline z^ and functions of time I^j^iit) or k^^j^{t) along that worldline. The 
quantities will be identified as algorithmic multipole moments that describe 
the object’s multipole structure, while each of the quantities k^^^^ will identify an 
effectively external, freely propagating perturbation. 

We arrive at these conclusions simply by substituting the expansion (47) into 
EiJiAh^] = 0- Fii'st note that spatial derivatives reduce the order in r by one power, 
while temporal derivatives do not affect the order in r. This means we can write 
Ef^iAh"^] = where W^Ah"^] ~ Solving for order 

by order in r is hence reduced to solving a sequence of Poisson equations = 

— Substituting the expansion (47) into this, we find that the coefficient of 

r^“^(ln r)^h^ in the equation is 



[p(p + 1) - m + + (q + l)(2p + + (q + l)(q + 



p-l Irr 



^max 



Z Z Z W,frP'(lnr/h^'hfp'] 

p'=—n i'=0 q'=0 



p—2,q,l 



(48) 



where {-}P^^ means “pick off the coefficient of r^~^(ln r)^h^'\ The important thing 
to note is that the right-hand side depends only on coefficients with p' < p. So 
start with the term with the lowest power of r, such that the right-hand 

side of Eq. (48) vanishes. Further, start with q = ^max, and suppose ^max > 0- 
By assumption, gc-n,qr^^^+\,l gQ gq becomes 

— = 0. This has a nontrivial solution only iff = n — 

Now look at the equation for the same I and p but for q = ^max — 1. It reads 
[n{n - 1) - l{l + ^ = 0. But the first 

term vanishes because f = ^ — 1. Hence, ^ = 0 if ^max > 0, and we 

conclude that h^ ~^ / r^ is the only nontrivial solution to the 1 / r^^'^ term 

in the homogeneous equation E^Ah^] = 0. 1 define the algorithmic moment 



jn _ rn,-n,0,n-l 



(49) 



Now proceed to sequentially higher orders in r, and at every order, set to zero all 
free functions that arise. The result is a solution to E^Ah^] = 0 of the form 
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+ 0(l/r"-‘) 



(50) 



in which every single term is linear in (and ^-derivatives of it), specifi- 
cally consisting of (0 (and its derivatives) contracted with the other available 

Cartesian tensors n\ a\ pieces of the background Riemann tensor, and 

pieces of derivatives of the background Riemann tensor. 

Now set to zero and solve the 1 / and higher-order terms in Eq. (48), 

following exactly the same steps as above. Again the result is a homogeneous solution 
that is completely determined by a single function, in this case ^ ^ • 

Now keep doing the same by beginning at each following order up to (and including) 
1/r^ , always setting every free function to zero except the first one encountered. This 
procedure completely populates the set z, I^) : 0 < I < n] with solutions 

of the form 



z, n) = ( 51 ) 

where every term in the solution is proportional to (t) or its 

derivatives. The algorithmic multipole moments are symmetric in their first two 
indices and STF in their last I indices. 

Continuing the same procedure to higher orders in r, we completely populate the 
other set, z, 7^) : 7 > 0}, with solutions 

z, k1) = + 0(/+^), (52) 

where every term in the solution is proportional to or its 

derivatives. The change in behavior from to arises because the factor 

[p(p + 1) — 7(7 -h 1)] in Eq. (48) vanishes for p = — 7 — 1 if p < 0 and for p = 7 
if p > 0. As with 7"^^, the quantities are symmetric in their first two indices 
and STE in their last 7 indices. 

It is easy to see from the above steps that the set of seed solutions z, I 

and free solutions z,k^) form a complete basis of solutions to = 0 

of the form (47). If extended down to r = 0, each seed solution diverges at r = 0, 
while each free solution is smooth there. 

Eor later purposes, it will be convenient to split each algorithmic multipole moment 
into a mass and current moment, 

= + (53) 

where the mass moment is the even-parity part of 7^^^, satisfying = 

current moment is the odd-parity part, satisfying = 
splits each seed solution into two, z, M^) and 
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3.3 General Solution 



Now turn to the inhomogeneous equation Start at the 

n = I equation, (21a). The source vanishes, and the general solution is made up of 
a single seed solution. 



Z, d) = + O(r0), (54) 

plus the sum of all free solutions z,k^). That is, 

hla, = d) + Z, k}). (55) 

f>0 

Next move to the n = 2 equation, (21b). The l/r^+^ term in the equation looks 
essentially the same as Eq. (48), except that the right-hand side contains a term 
coming from , h^]. One can straightforwardly solve for the functions to 

find that each one of them is either the starting point for a new seed or free solution 
^ ^ or for f > 0) or it is directly proportional to a pql mode 

of the source. Since the source is constructed from quadratic combinations of 
it follows that the general solution is made up of the two seed solutions 

1^ .(tW 

Z, /f) = + 0{\/r), (56) 

/2 (t) 

{x; z, Iq) = — h U{r ), (57) 

plus the sum of all free solutions z, kf), plus a particular inhomogeneous 

solution made up entirely of terms that are quadratic combinations of members of 
the set {/^^, : I > 0} (in other words, a particular solution in which all seed 

solutions and free solutions are set to zero). That is, 

hi. = h^;^^(x-, z, ll) + z, I^) + z. kj) 

l>0 

+ hli^{x;z,I^Aklh>o). (58) 

The promised logarithms of r first appear in h^J/(x;z,I^Akl}e>o)[n, 15], 
Carrying this procedure to ^th order, we find 



n—l 



‘'fiv 



Y^h^^{x-,z,I1) + Y.hl^{x-,z,kl) 



1=0 



l>0 



+ hll^ix; Z, {I^U<n,i<n, {kT}m<n,t>0)^ 



( 59 ) 
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where all functions appearing in are made up of explicit products of 

members of the set m < n, I < n — 1} U : m < n, I > 0} (or 

their t derivatives) contracted with combinations of n\ a\ pieces of the 

background Riemann tensor, and pieces of derivatives of the background Riemann 
tensor. 

In any particular solution of the form (47) to the relaxed Einstein equation outside 
a small object, every term, to all orders in r and e, is a linear or nonlinear combination 
of the functions I"j,i^{t) = {t) and k"^^i{t) = Specifying all 

the functions and k^^^(t) corresponds to making a specific choice of small 

object and of global boundary conditions. 



3.4 Imposing Matching and Gauge Conditions 

To satisfy the unrelaxed Einstein equation, the general solution described above must 
also satisfy the gauge conditions (26), which we can write generically as 

Fo] = (60) 

for some that is linear in the /z’s but not usually in the E’s. Recasting this as a 
constraint on the seed and free solutions in terms of lower-order fields, we have 



n—\ 

Fo] + Fo\ 

1=0 £>0 

= (61) 

Like the relaxed Einstein equation, Eq. (61) can be solved order by order in r , thereby 
constraining the functions and , which in the general solution to the relaxed 
Einstein equation were entirely arbitrary. Working through sequential orders in r first 
constrains each in sequence, from f = ^ — ltof = 0, and then each in 
sequence from £ = 0 to f = oo. The most important outcome is evolution equations 
for the moments 

Ain = 1 , the term with the lowest power of r in Eq. (26a), 1 / , is 5^ “ 

= 0(l/r), which determines = 2m (0^/xz^. The overall factor 
is written as 2m{t) for good reason: from Eq. (42), the inner background metric has 

an expansion H- 0(l/r^), and substituting that expansion into the formula 

for the Arnowitt-Deser-Misner (ADM) mass, one finds that m{t) is precisely the 
ADM mass of . We call this the leading-order mass of the object. Proceeding to 
the next order in r, 1/r, time derivatives and acceleration terms appear, and we find 
our sought evolution equations 
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- = 0, r- = 0, (62) 

which tell us that the object behaves approximately as a test body in The remain- 

der of the content of the n = 1 gauge condition, at all orders in r , is to enforce various 
uninteresting relationships between the functions . 

At n = 2, the lowest order in r in the gauge condition (26b), 1/r^, similarly 
determines (after a very slight gauge refinement that remains within the Lorenz 
gauge [11]) that divides into mass and current moments of the form 

= 2Mi V (r) (63) 



where etij = 0. M/ and are, respectively, the mass dipole moment and ADM 

angular momentum of we call these the object’s leading-order mass dipole 

moment relative to 7 and leading-order spin about 7. Defining 5m^y{t) = 
we also find (from the order- 1/r^ and 1/r terms in the gauge condition) that 



5ma[} = \m ^ 2^7 + 9a(}9'^'' 

+ 4u(^a(mkp)^u'^ + 2Mp)), (64) 

where = 7^- ^^a/3 can be thought of loosely as a correction to the object’s 
monopole moment, but it is not invariant; a smooth gauge perturbation that 

is nonvanishing on 7 will add directly to k^p. This is why the moments are called 
algorithmic: they arise in the algorithm of solving the field equations, but they are 
not invariant descriptions of the perturbed object. Furthermore, 6m is determined 
only up to a constant c6ag, but I choose to incorporate that term into 

Finally, the 1/r^ term in the n = 2 gauge condition determines the evolution 
equation ^ = 0, which tells us that the leading-order spin is parallelly propagated 
along 7, and the 1/r term determines 



d^Ma 

dt^ 



d 

-RaOboM^’ - niF^ + \R0aijS'K 



(65) 



where I have defined I call this the master equation of motion. It 

describes how the small object moves relative to any choice of nearby worldline 7. 
That motion is influenced by (i) geodesic deviation (evinced by the first term on the 
right), (ii) the acceleration of 7 (manifesting as F^), (iii) the ambient free fields in 
the neighbourhood, and (iv) the Mathisson-Papapetrou force created by coupling of 
the spin to the curvature of the external background metric. In the self-consistent 
expansion, we choose 7 such that = 0. Therefore, 
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This is the MiSaTaQuWa force (albeit in an atypical form) plus the Mathisson- 
Papapetrou force. We see more clearly now that the practical consequence of the 
self-consistent approach, by utilizing a worldline relative to which vanishes, is to 
prevent the term Min!' from appearing in the second-order field (58). According 
to Eq. (65), this term would otherwise grow quadratically with time; this is the same 
undesirable growth discussed in Sect. 1.4. 

As at ^ = 1, at ^ = 2 the gauge condition imposes relationships between the 
functions these relationships do not involve any of the algorithmic moments, 
and they leave each written in terms of arbitrary functions of time. 

Extrapolating to arbitrary order n, we draw several conclusions: (i) The moment 
appearing in the leading order term !r^ is directly identified with a 

moment of . Explicitly, as the mass appears at order e and the spin at order e^, 
so the quadrupole moment will appear at order e^, the octupole at order e^, etc.^^ 
We interpret these moments as the leading-order moments of the object, 

(ii) The others, with f < n — 1, we interpret as perturbations to the object’s 
moments, and the gauge condition determines that each of these corrections involve 
some linear or nonlinear factors of moments of . (iii) Even after the gauge and 
matching conditions are imposed, every coefficient (0 appearing in the free 
fields is made up entirely of arbitrary functions of time that are determined only once 
global boundary conditions (BCl) are also imposed.^"^ 

When going to higher order, it is not obvious what condition should be imposed 
to make the worldline a good representation of the object’s bulk motion. A natural 
candidate would be to set the corrections to the object’s mass dipole moment 
to zero. However, one would have to investigate the third- and higher-order gauge 
conditions to see whether (i) doing so is possible and (ii) doing so prevents unwanted 
secular growth that can be associated with a growing displacement of the object 
away from the chosen representative worldline. Section 7 offers a different method 
of choosing a good worldline, at least at second order. 



^^To relate back to the Thorne and Hartle results (13) and (14), canonical mass and current quadru- 
pole moments Qtj and Qtj can be defined from and S^^-j according to 

~ j ^ (67) 

similarly to Eq. (63). In choosing the normalization factors of these moments, I follow Ref. [68]. 

proof of statement (iii) will be presented elsewhere. Here it can be taken as a conjecture, although 
it is known to be true at all orders in e and r that have been explicitly considered. Intuitively, it 
can be inferred from the fact that we can choose boundary conditions for which all vanish 
at a given n, and with that choice we must still be able to satisfy the gauge condition; hence, the 
constraints on the cannot involve the k^^^'s of the same n. 
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3.5 Split into Self- Field and Effective Field 



I now define a convenient split of into a self-field and an effectively external 
field Specifically, I define to be the piece of Eq. (59) that contains no linear 
or nonlinear combinations of the algorithmic moments : 

^") + Z, {k'^}m<n,l>o), ( 68 ) 

£>0 



where is the part of containing no factors of 7^^^ . This implicitly defines 

the self-field h^l = to be 



‘'fiiy 



n — l 



Z, q) + Z. {I^U<n,e<n, {kTU<„,l>o), 



(69) 



where every term in contains at least one factor of at least one algorithmic 

moment. I also define and Conclusion (iii) at 

the end of the last section implies that each of these two fields separately satisfies 
the Lorenz gauge condition.^^ 

More explicitly, at first order, -h h^l, with 

hli = h^^^ix; z, m) = + 0{r^), (70) 

+ 0{r^). (71) 

£>0 



Recall that for any particular solution of the form (47), the functions in the free fields 



are 



= h 



ijiuL 



At second order, h 



jiv 



with 



6') + h^^f{x\ z, 5m) + h^^J^ix; z, m, {kl}t>o), (72) 

hll = z, kj) + hfj^ix-, z, {k\h>o), (73) 

£>0 

where 



h^^ix- z, S) 



AS^n' 

— + 0(l/r), 



(74) 



the statements were not true, one could always slightly alter the singular-regular split to make 
the two fields independently satisfy the gauge condition. Doing so would involve appropriately 
moving part of the free fields into [15]. 
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z, 5m) = ^ + 0{r^), (75) 

(76) 

h^^J^{x\ z,m, {kl}i>o) is made up exclusively of terms quadratic in m or products 
of m and h^l (and derivatives of h^l), and z, {^|}£>o) is made up of terms 

quadratic in h^l (and derivatives of h^l). 

I remind the reader that in Sect. 1.1.2 I identified three “nice” properties I wish 
an effectively external metric to satisfy: 

1 . it should be a vacuum solution 

2. it should be causal on the worldline 

3. the object should move as a test body in it. 

There is no guarantee a priori that these conditions can simultaneously be satisfied 
at all orders in e, but with the definitions given above, the first nice property is met: 
the effective metric is a smooth solution to the vacuum Einstein 

equation = 0 even at r = 0. This can be seen from the facts that (i) by 

construction, for r > 0 we still have a solution even if we set all moments 
to zero (i.e., if there is no object in S^), and (ii) for the free fields the construction 
applies even at r = 0. Note that here I am taking a particular series (68) obtained 
outside the object and simply analytically extending it down to r = 0. In this way, I 
define an effective spacetime (§^^, M) in where the object lies in the physical 
spacetime. 

More explicitly, the first- and second-order regular fields satisfy the vacuum equa- 
tions 



E^Ah^^] = 0, L^[h^\Fo] = 0, ill) 

= 25^R^Ah^\ LAh^\ Fq] = -SLAh^\ Fi], (78) 

for all r > 0. The self-fields are left to satisfy 

E^Ah^^]=0, LAh^\Fo]=0, (79) 

E^Ah^^] = 2S^Ri,Ah^\h^^]+4S^R^Ah^\h^^], (80) 

LAh^^,Fo] = -5LAh^\FA (81) 



The third “nice” condition is also met, at least through order e: combining Eqs. (66) 
and (71), we get, in covariant form, 

^ = -ip>^A2hf^.^ - hf^.AuPu^ + + 0{€^), (82) 
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where I have defined the antisymmetric 4-tensor Together with 

^ = 0, Eq. (82) is the equation of motion of a test body in It can be put in the 
form of the Thorne-Hartle equations (13) and (14) by using the results of Appendix 
1 to absorb the h^l terms into the covariant derivative and proper time in using 
= mu^ + 0{e^) and the fact that m is constant, and using Eq. (46) to write the 
Riemann tensor in terms of the tidal field Bfj (noting 93/^ = Bfj -h 0(e)). Doing 
so allows us to identify g^jy with Thorne and Hartle’s “external” metric at least 
in the weak sense that g/^jy = Q^iy -h 0(e^, er^)\ since only first derivatives of 
appear in Eq. (82), we can make no claims on agreement at order or higher. 

I show in the next section that my definition of the regular field also satisfies the 
remaining nice property: it is causal on the worldline. I also note that at least through 
order r^, my definition of agrees with the Detweiler- Whiting definition [52], 
despite the fact that the Detweiler- Whiting field is defined in a wholly different 
manner based on Green’s functions [9, 23]. However, despite all these reassuring 
facts, I stress that in general, even if the effective metric I define is found to satisfy 
each of the three ''nice ” properties at a given order , it is not the unique field 
satisfying those properties. One can simply shift any given free field z,k^) 

(with n' < n and I sufficiently large for to not appear in the nth equation of 
motion), and its nonlinear combinations, from the regular field into the self-field. 
The strongest of my “nice conditions” appears to be that the object behaves as a 
test body in the effective metric: since higher multipole moments couple to higher 
derivatives of the effective metric, this strongly constrains which part of the full 
metric should go into the effective one. At any finite perturbative order, however, 
one can always alter the effective metric’s higher derivatives without spoiling any of 
its nice properties. Nonetheless, I deem my “nice” choice most natural as a part of 
the process of solving the relaxed Einstein equations using the local expansion (47): 
before making reference to global boundary conditions, I simply put all the terms 
that involve the object’s multipole moments into the self-field, and I put all the terms 
made up entirely of unknown functions into the effective field. 



3.6 Summary at First and Second Order 

To extract the key pragmatic information of the preceding sections, I restate the 
conclusions at first and second order: The first-order field is given by Eqs. (70) 
and (71), where the self-field is locally determined by the (constant) mass m, 
and the effective field h^l is to be determined by global boundary conditions. The 



^^The monopole correction (64) can also be trivially rewritten in terms of as 

5ma(3 = \rn + m(gap + 2uaUp)u^u^h^l + 4u(amh^^^u^, (83) 

where I have set = 0, and all fields are evaluated on 7. 
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second-order field is given by Eqs. (72) and (73), where the self-field is locally 
determined by (i) the mass m, (ii) the first-order effective field h^l, (iii) the (parallely 
propagated) spin 5^, and (iv) the monopole correction 6m given in Eq. (83); the 
effective field is to be determined by global boundary conditions and by h^l. I 
encourage readers to examine the more explicit expressions for these fields given in, 
e.g.. Ref. [52]. 

Einally, the object moves on a worldline governed by Eq. (82). 



4 Algorithm for an nth-Order Self-Consistent 
Approximation: Point Particles, Punctures, 
and Global Solutions 

With the general solution in the buffer region ready at hand, I now describe how 
to use it to generate a global solution. In short, this relies on a rigorous procedure 
of replacing the physical field in with a fictitious field without altering the field 
elsewhere. What are the fictitious fields? Simply the analytical expansion (59) from 
the buffer region, extended to apply to all r > 0. This was done already for h^^ in 
the previous section. Continuing the expression for into makes it into a field 
that diverges at r = 0: the true self-field in the interior of the body, whatever it may 
be, is replaced with this divergent field, and the self-field becomes the singular field. 
Since this extension into B^ does not affect the field in the buffer region, it also does 
not affect the field values outside the buffer, out in the external universe. 

Section 4.1 describes how this analytical extension can be used to ascribe a cer- 
tain pointlike stress-energy distribution to the object, thereby recovering the point 
particle picture at linear order (though this stress-energy I construct cannot be taken 
as a physical source in the Einstein equations beyond linear order). Section 4.2 then 
describes how the analytical extension can be used to obtain a global solution at any 
order; this will be the first point at which a global boundary condition (BCl) is finally 
imposed. 



4.1 Skeletal Stress-Energy 



Consider the solution (59) extended down to r >0. Eor reasons described in Sect. 1 .2, 
there is no known distributional source for this solution on a domain that includes 
r = 0. However, we can devise the following setup: each of the seed solutions can be 
thought of as being sourced by a pointlike stress-energy supported on 7. Everything 
else in the general solution then grows from these seeds, either being generated by 
nonlinearities or in the case of the free fields, being determined by global boundary 
conditions. 
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I refer to the stress-energy for the seed solutions as the skeletal stress energy, 
taking after a similar phrase in Ref. [35] ; the idea is that in some sense, the object (or 
body) can be replaced by a skeleton. That skeleton is made up of multipole moments 
living on the object’s worldline. 

For each seed solution z, /” ), I roughly follow the approach taken by 

Gralla and Wald at first order [18], defining the (trace-reversed) distributional stress- 
energy tensor 

^ (84) 

The right-hand side can be written as 



= d‘di (85) 

where = d’dt 

First examine the most singular term. Using the identities = (—1)^(2^ — 

1)!!^;^ 3ndd^dir~^ = — 47T(5^(x), we have 






47r(-l)^+>/;, 



diS (x). 



A+^ (21 - 1 )!! 

Integrating the right-hand side against a test function we find 

J pj'^‘'I^„LdLS^(x)^/^d^xdt = (- 1 )^ J dt dt 

= (-1/ [ 



( 86 ) 



(87) 

( 88 ) 



where in going from the second line to the third I have utilized the identity di = 
Here I have defined to be the tensor that agrees with if all 

at are spatial indices and zero otherwise, meaning that is sxF with respect 

to 9iiv and that = 0. Equation (88) shows that 

l^n)^dL5^{x) = f I^^-^°‘'"'°‘^5{x,z)-avaidT, (89) 

J-f 

Equation (85) now reads 

^ /(':7‘-“^(5(x, zy,ai-atdT + 



(90) 
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Now note that, by construction, N vanishes pointwise for r > 0. If it 
is nonvanishing as a distribution,^^ it must have support only on 7, in which case it 
must be proportional to or a derivative thereof; but from the calculation just 
performed, that would lead to a different algorithmic moment (with I' / 1) 
appearing in the solution, which would contradict the definition of the seed field 
So N vanishes as a distribution. 

Therefore, Eq. (84) becomes 



frix- Z, I'D = 5i)!! I (91) 

If we add up all the multipole moments, we arrive at a skeletal stress-energy tensor 
ft^u ^ ^ jn^ = X / z)-,av-a,dT, (92) 

where I have defined the corrected and normalized moments 4 ( 2 £-\)!! 

^ Notably, this skeletal stress-energy agrees in form with that of the tradi- 
tional multipolar expansion of a material body’s stress-energy tensor [35, 69, 70]. 
At first order in e, there is only one seed field, z,m), and Eq. (91) gives 

T^'^(x\z,m)= / mu^u^S(x, z)dr (93) 

J'l 

— precisely the point particle stress-energy used in Sect. 1.1. Because = h^l -h 
z, m), and = 0, we can conclude that = tI^^(x; z). In 

other words, the first-order field is identical to one sourced by a point particle. Of 
course, this is derived from the analytically extended field, but it also applies to the 
physical field at distances r ^ e from the worldline. 

At second order, there are seed fields, z, ^m), z, 5'), and 

(x; z, M); although the last of these three we set to zero with our choice of z^, it is 
worth displaying the skeletal stress-energy that would source it if we chose a different 
z^. Equation (91) gives 

T 2 ^ (x; z, Sm) = -^ j 6(x, z)dr, (94) 



^^That it must be well defined as a distribution follows from it being the result of linear operations 
on and itself being a sum of terms constructed from linear operations on an 

integrable function. The latter fact follows from the first term in the sum being expressible as the 
linear operation di on an integrable function proportional to [as in the text above Eq. (86)], and 
all higher-order terms in the sum being constructed from linear operations on lower order terms in 
the sum (as described in Sect. 3.2). 
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z,S) = - I z)dT, (95) 

J'j 

T^''{x\ z,M) = - j u^u’'M^VaS{x, z)dT, (96) 

where the overline indicates trace reversal. Appendix 2 shows that \ zo, 

[plus a piece of {x \ zo. dm)] is precisely equal to the linear term in the expansion 

of t!^^{x\ z, m) given z^ = Zq + jm + 0(e). 



4.2 Puncture Schemes 



I now describe how one can use the results from the buffer region to obtain a global 
solution. The method is called a puncture scheme. It has become standard in the 
linearized problem [54-57] , and it is the only known practical way to obtain numerical 
results at second order and higher [14, 15, 20, 52, 71]. 

A puncture scheme begins with the construction of a puncture defined by 
truncating the local expansion of the singular field from the buffer region at some 
specified order. One then defines the residual field 

( 97 ) 

and in a region covering the object, writes a field equation for rather than 
one for (the analytically continued) physical field h/^^. Since ~ so too 
/z^ ~ . The better represents , the better /z^ represents . For example, 

iflim;c^ 7 [/i^z/(-^)“^^z/(-^)] = 0,thenlim;,_>^/z^(v) = lim;,^^ /zj^^(v); thatis, the 
residual field agrees with the regular field on the worldline. If agrees with /z^^ to 
one order higher, meaning = o(r), then lim;c ->7 = hni;c ^7 

since the self-force is constructed from first derivatives of /zj^^, this condition guar- 
antees that the force can be calculated from /z^, as in Eq. (100) below. 

There are several schemes that can be developed from the starting point of the 
puncture. Here I describe a worldtube scheme in the tradition of Refs. [54, 56] . In this 
type of scheme one uses the field variables /z^'^ inside a worldtube T surrounding 
7, the field variables /z^^ outside that worldtube, and the change of variables /z"^ = 
/z^^ + when moving between the two regions.^^ Concretely, a second-order 
puncture scheme is then summarized by the coupled system of equations^^ 



^^One does not solve the problem in each domain separately, since the separate problems would 
be ill-posed. Instead, when calculating at a point just outside F that depends on points on past 
time slices inside F, one makes use of the values of already calculated at those earlier points, 
and vice versa; see Sect. VB of Ref. [54]. 

^^The effective sources are usually written to include the skeletal stress-energy terms, which 
are canceled by distributional content in Here I have instead followed Ref. [20] by 
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II 
1 
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inside F, 


(98a) 


II 

0 


outside F, 


(98b) 


= 25^R^,Ah\h^] - Ef,Ah^^] ^ 


inside F, 


(99a) 


E^Ah^] = 25^R^Ah\h^] 


outside F, 


(99b) 






(100) 



where the puncture diverges on the worldline determined by Eq. (100). Here I have 
looked ahead by using the second-order equation of motion (for an object whose spin 
and quadrupole moments vanish); the first-order equation (9) could be used instead, 
though as discussed in Sect. 1.5.4, the results would be accurate in a smaller region 
of spacetime. 

In this scheme, Eqs. (98)-(100) must be solved together, as a coupled system 
for the variables I (inside/outside E), and I The residual fields 

govern the position of the puncture, and the position of the puncture effectively 
sources the residual fields. This system of equations is to be solved subject to some 
global boundary conditions. Eor simplicity, we can consider using specified initial 
data on a Cauchy surface. 

I make two technical asides: according to Eq. 72, contains terms proportional 

to the first-order regular field h^l. So at each timestep in a numerical evolution, one 
must first calculate the first-order residual field from the first-order puncture, then 
use that residual field to calculate the second-order effective source, and so forth. 
Also, the divergence of the individual sources is quite strong at the worldline, with the 
terms 26^Ri^iy[h^, h^] and in Eq. (99) each going as by construction, 

these divergences analytically cancel each other, but the cancellation is numerically 
delicate. 

In principle, there is no obstacle to using a puncture scheme at any order in 
e. Outside F, one may solve the physical problem 

inside F one may solve the effective problem . . . , h^~^] — 

EnAh^^] = and when crossing F , one may change variables from the residual 
field to the full field via -h Of course, this requires sufficiently high- 

order expressions for the puncture and for the equation of motion governing how the 
puncture moves. But we know how to obtain both expressions from a local analysis 
in the buffer region. 



(Footnote 29 continued) 

writing the source pointwise, off 7; if the puncture agrees with the singular field sufficiently well, 
the source at points on 7 can be defined as the limit from off 7. 

^^Because the approximation is accurate only within a finite region of size 1 /e, one might better 
solve the equations in the future domain of dependence of a partial Cauchy surface. 
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Fig. 3 Replacement of the physical metric with an effective metric plus a puncture. In the top 
row, the physical metric is split into a self-field plus an “effectively external” metric 
9 HU = 9i^u + h^u - bottom row, we “black out” the region in and very near the object, and we 
replace the physical metric with a singular field (any local approximation to which is called a 
puncture) plus the “effectively external” metric. The replacement of the self-field with the singular 
field is made only very near the object: the self-field and singular field are identical to one another 
in the buffer region and beyond; the effective metric is completely unaltered everywhere it is 
defined 



The basic picture of a puncture scheme is illustrated in Fig. 3. Physically, the 
scheme replaces the actual problem in the region covering the body, with all its 
matter fields, singularities (in the case of a black hole), or other oddities (in the 
case of exotic matter), with an effective problem. But it yields the correct physical 
field outside the object. Hence, while we must abandon the point particle model at 
nonlinear orders, we can replace it with the more general concept of a puncture, a 
local singularity that encodes all the necessary information about the object in its 
multipole moments. 



4.3 The Causality of the Regular Field 

I now return to the question of how “nice” the regular field is, by which I specifically 
mean the three enumerated properties in Sect. 3.5. I have already shown that at all 
orders, the regular field I defined is a vacuum solution. I have also shown that at first 
order, the object moves as a test body in the effective geometry it induces, and (for 
spherical, nonspinning objects) this result will be extended to second order in Sect. 7. 
Now the final property follows immediately, at all orders in e, from the design of the 
puncture scheme: the residual field and its derivatives manifestly depend only on the 
causal past, and by construction, the residual field and any number of its derivatives 
agree with those of the regular field on the worldline. Taking the limit of an infinite 
number of derivatives, we get the desired result. 
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4.4 A Note on ^^regularization^^ 

In the gravitational self-force literature, one often speaks of “regularizing” the field 
or the self-force. This can mistakenly give the impression that one has introduced 
infinities into the problem, and that one must regularize them to recover the physical 
result. But in the formalism I have described here, one only ever deals with finite 
quantities. The force on the object, for example, is derived from the field equations 
outside the object, and it is written in terms of manifestly finite fields in that region. 
Those fields outside the object are then written in terms of quantities on the worldline 
by identifying S\t), a field off the worldline, with e^^S^{z(t)), for example; or 
similarly, by defining the regular field such that (z (0) = {t) and ■ (z (0) = 

From this perspective, the various “regularization” methods that have been used 
in the self-force problem to remove the ‘singular part’ of the field [8, 9] arise only as a 
practical necessity: we cannot easily determine the physical metric inside the object, 
nor are we interested in doing so, which prompts us to replace it with the fiction 
of a singular field solely as a means of calculating the physical metric outside the 
object. Computational techniques such as puncture schemes and mode- sum “regu- 
larization” [8, 9] are not methods of removing singularities; they are simply methods 
of calculating the particular finite quantities in question. In mode-sum regulariza- 
tion, for example, one rigorously writes a spherical-harmonic mode decomposition 
of h^liz) by decomposing h^l{z) = - h^l), with being the field 

of a point mass. Every quantity in the calculation is finite every step of the way. 



5 Gralla-Wald and Osculating-Geodesics Approximations 

Given results in the self-consistent approximation, one can always obtain analogous 
results in a Gralla-Wald or osculating-geodesics approximation by performing an 
expansion of the worldline, as described in Sects. 1.5.5 and 1.5.6. In this section I 
present that procedure and those results. 



5.1 Gralla-Wald Approximation 

There are two ways to obtain the Gralla-Wald approximation from prior results: by 
expanding the worldline in the self-consistent results; or more simply, by returning 
to the results in the buffer region and simply choosing the worldline at r = 0 to be 
70, the zeroth-order worldline. 
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For the moment, let us take the second route. All the local results are unchanged, 
except that is set to zero and the mass dipole moment is not set to zero. This 

means that the metric perturbations are modified from Eqs. (70)- 

(73) to be 



hf, = zo, m) = + O(r^), (101) 

^ 0 ’ ( 102 ) 

f>0 



and 

hf. = 20 , M) + zo, 5) + zo, 5m) 

+ hfj^{x-z^,m,{k\h>^,). (103) 

20, {d)^>o), (104) 

£>0 

where 

2Mn^ 

h^^fix- zo, M) = V + 0{\/r), (105) 

and Sm^jy differs from Eq. (83) by the inclusion of the M/ term in Eq. (64), becoming 

5ma/3 = \m +m{ga/3 + 2uaup)u^u''h^l 

+ 4u(a(mh^^^u'^ + 2Mg)). (106) 

Otherwise, all functionals remain completely unchanged, except that they are eval- 
uated as functionals of Zq rather than of z^. The regular field naturally remains 
a vacuum solution even on 70, its first two orders satisfying = 0, 

Ef,u[h^i] = 25^R^,[h^\h^^l and L^[h^>^] = 0. 

Finally, if we define Zij_ = / m to be the deviation perpendicular to 70 (defining 

=0), then from the master equation (65), we get the Gralla-Wald equation 



( 107 ) 
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Just as in the self-consistent case, these local results can be used in the design of 
a puncture scheme to obtain global results. However, before describing that scheme, 
I return to the other method of obtaining the Gralla-Wald approximation: by substi- 
tuting an expansion of the worldline into the self-consistent results. 



5.1.1 Expansion of the Worldline 

We would like to express the expansion covariantly, in terms of vectors that live on 

70. Suppose we begin with an expansion z^(5', e) = Zq(s) -h €Zi(s) -h e^zf^is) H , 

where Zn(s) = ^ The linear term, Zi(s), is automatically a vector on 70, 

since it is a first derivative along the curve of constant 5' and increasing e. However, 
beyond linear order, the terms are no longer tensorial; each of them is simply a 
collections of four scalars dependent upon the particular chart in which z^ (^ , e) = 

v^(7(^, e)). 

So let us approach the problem more geometrically. To facilitate the expansion, I 
introduce a Lie derivative £ that acts on a functional’s dependence on z^' 

£^Aa(x-, z) = -^Aa(x\ z + XO\x=o^ (108) 

where A is a multi-index of any covariant and contravariant rank. This is closely 
related to a Lie derivative acting C moves the field point relative to the world- 
line; £ moves the worldline relative to the field point. The two operations are not 
identical, since the tensorial character of the functional is different at the two points. 
For example, in Eq. (217) we see that \ z) is a rank-two tensor at x^ but a scalar 
at z^. 

An expansion of a functional of the worldline z^(s, e) in the limit e ^ 0 is really 
an expansion along the flow of increasing e. We can write this as 

Aa(x; Ze) = Aa(x; zo) + e6AA{x; zo) + €^S^Aa(x; zo) H , (109) 

where 

6^Aa(x;zo) = -£^,Aa(x;zo); (HO) 

n\ 



here ^ is the generator of the flow. 

Now, let Z2/7 = This is a vector on 70. It is equal to Z2 if the 

expansion of the worldline is performed in Fermi normal coordinates. One can obtain 
a covariant evolution equation for it from a second-order self-consistent equation of 
motion, as described in Appendix 1. Beginning from Eq. (191), this procedure leads 
to Eq. (205), for example. More importantly for the present analysis, we can also write 
the expansion (1 10) in terms of this quantity. As an instance of that, the second-order 
term is 
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^4Aa(x; zo) = Z2f^h’Aj^{x\ zo) + ^4 zf V^/V,,/Aa(x; zo), (HI) 

where primed indices refer to the point Zq . One can do the same at any order: from 
the equation of motion for obtain evolution equations for vectors on Zq , and write 
the expansion (109) in terms of those vectors. 

Applying this expansion to the perturbations from the self-consistent approxima- 
tion, we have 

€"h"^{x- z) = 2o) + €'’~^'£^^h"^(x; zo) + C>(e"+^). (112) 

Therefore, the Gralla-Wald expansion (31) reads 

hfiiy = eP^^(x; zo) + zo, Zi) + O(e^), (113) 

with \ zo) being the same functional as z), but with Zq having replaced 

z^ in its argument, and with the second-order perturbation given by the new functional 

p^^{x\ zo, zi) = hf^^(x; zo) + Sh^^ix; zo, zi). (114) 

where <5/i"^(x; zo, zi) = iz^hn^ix- zo). 

The same expansions are applied in precise analogy for the singular and regular 
fields, yielding with 

=hl{^\x;zo), (115) 

zo) + Sh^JJ^\x-, zo, zi). (116) 

Appendix 2 describes these expansions in more detail. But in a few words, the 
end result is that the procedure exactly recovers Eqs. (101)-(107). All of the meat of 
the result is in zo, zi): the action of £ on precisely generates the mass 

dipole moment seed field zo, M) and the contribution of the mass dipole 

moment to Appendix 2 also shows that (i) the generation and modification of 
these two seed fields precisely corresponds to the linear term in the expansion of the 
skeletal stress-energy z) around z^ = Zq, and (ii) the term (5/z^^(v; zo, zi) 

[and analogously, 6h^{^^(x; zo, zi)] can be written as a functional 6hj^j^(x; zo, zi±) 
that depends only on the perpendicular piece of z^. Result (ii) is in agreement with 
the fact that the seed fields depend only on = mz^^. Physically, this results from 
the field equations’ indifference to the piece of the deviation that is tangential to the 
worldline, which can always be set to zero by reparametrizing the family z^(s, e) 
with a parameter = 5’'(5’, e) that ensures curves of fixed s' strike Zq orthogonally 
at e = 0. 
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5.1.2 Global Metric 

A global solution can be obtained from the local results just as in the self- 
consistent case. I first note that the local singularity structure of the singular field 
-h is identical to the self-consistent singular field, but for two 

important alterations: 

• The divergent terms diverge on 70, not on 7. 

• The second-order singular field depends on the correction to the position. 

Because the point at which the puncture diverges is independent of the perturba- 
tions in this expansion, the puncture scheme becomes a sequence of equations, 
rather than a coupled system: first, the zeroth-order worldline is prescribed as a 
solution to the background geodesic equation, 

^=0, (117) 

dT^ 

then the first order field is found from 

= inside To, (118a) 

Efiu{h^}=^ outside To, (118b) 

then that field is used to find the first-order correction to the position by solving the 
Gralla-Wald equation (107), and finally the second-order field is found from 

E^Ah^^\ = 25^R,,Ah\h^]-E^Ah^^\ inside Tq, (119a) 

E^Ah^] = 25^R^Ah'^,h^] outside To- (119b) 

Here Tq is a tube around 70; unlike in the self-consistent case, neither 70 nor Tq 
need be updated over the course of the numerical simulation. Like the self-consistent 
puncture scheme, these equations can be solved given initial data on a Cauchy sur- 
face.^ ^ 

Again, there is no obstacle to carrying a puncture scheme like this to arbitrary 
order, given the local solution in the buffer region. 



5.2 Osculating-Geodesics Approximation 

I refer the reader back to Sect. 1.5.6 for a reminder of how the osculating-geodesics 
approximation works. In brief, it obtains a self-consistent approximation by first 



Since the approximation is held to be valid in a region of size <^(e) 1 / ^, a reasonable approach 

would be to solve the equations in the causal future of a partial Cauchy surface of that size. 
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using a sequence of Gralla-Wald approximations to find the self-consistent worldline 

then solving the relaxed field equations with already determined. Concretely, 
through second order one seeks a solution to the self-consistent system (98)-(100) 
by applying the Gralla-Wald approximation (1 13) to the right-hand side of Eq. (100) 
at each instant r, with the expansion e) = 

around the geodesic that is instantaneously tangential to z^ at time r. The 
terms h^^{z{r) \ zo, • • • , Zn-\) that then appear in Eq. (100) are found by solv- 
ing Eqs. (117)-(119). By using the resulting sequence of forces eF^(r\ zo(r)) + 
e^F2 (t; zo(r)^ zi{t)), one can solve Eq. (100). Einally, one can solve Eqs. (98) and 
(99) using the z^ one has already found. 

What is the advantage of this over simply solving Eqs. (98)-(100) directly? At 
linear order, the advantage is that one can solve Eq. (118) very easily in the frequency 
domain, while the self-consistent equations do not seem to admit a frequency decom- 
position. One can then compute a table of values of fI^ (r; zo(r)) for different geo- 
desics and easily solve = eF^ (r; zo(r)) to obtain an approximation to z^ [60]. 
But at second-order, it is not obvious whether the scheme is simpler than a direct 
solution of the self-consistent equations. The term zo(r). ^i(r)) that appears 

in the second-order field (and a piece of which appears in the puncture grows 
with time away from the osculation point = z^{r)\ can it be handled in an efficient 
way? Eurthermore, one must ensure that appropriate boundary conditions are used in 
each Gralla-Wald expansion to reproduce desired global boundary conditions (such 
as no incoming radiation) on the self-consistent field; since the approximations only 
apply in a region of size ^ 1 / ^J~e, is there any simple way to find these boundary 
conditions? I leave these questions open. 



6 Gauge Transformations in Perturbative 
Descriptions of Motion 

Thus far, all the explicit results I have presented have been confined to a single choice 
of gauge. But as I described in Sect. 1.4, gauge and motion are intimately related in 
perturbation theory: any worldline z^ one finds can always be shifted by an amount 
Precisely how this impacts one’s approximation scheme depends strongly on 
how one represents the worldline. 

The effect of a gauge transformation on the worldline (or equivalently, on the 
self-force) was first explored by Barack and Ori [50]. Their results were extended to 
discontinous gauge transformations by Gralla and Wald [18, 19]^^ and to even more 
singular transformations in Ref. [3 1] . In this section I focus, for simplicity, on smooth 
transformations, and I aim mostly at (i) clarifying the gauge freedom in each of the 



^^The extension of Gralla and Wald’s result to the self-consistent case [12] unfortunately contained 
a significant error, leading to a result that held only in gauges continuously related to Lorenz, as in 
the Barack-Ori analysis. 
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different representations of motion and their corresponding approximation schemes, 
and (ii) defining appropriate rules for the gauge transformations of the singular and 
regular fields. Reference [18] contains a more comprehensive treatment. 



6.1 Review of Gauge Freedom in Perturbation Theory 

Before considering the question of gauge in self-force theory, I briefly remind the 
reader of the basics, following Refs. [72-74]. It will be convenient in this section to 
adopt index-free notation for tensors. In perturbation theory, we consider a family 
of metrics e), or simply g in the absence of a chart. This family lives on a 

family of manifolds Me, and a given choice of gauge refers to an identification map 
: Mq Me- The identification map induces a flow through the family down to 
the base manifold Mo where the background metric g = go lives. Call the generator 
d d)^ 

of this flow X = We wish to approximate a tensor A at a point (pfip) G Me 
as an expansion around its value at e = 0. This expansion is given by 

= y ^(£^A)(;,), (120) 

^ n\ 

n>0 



where * denotes the pullback of , £ is the Lie derivative, and p e Mo- We 
define the ^th-order perturbation in this gauge to be 

A^ip) ^ 2(£« A)(p). (121) 

n\ 

Now say we work in a different gauge. This corresponds to a different choice of 

identification map ^ Afe and flow generator F = . The approximation 

of the tensor A in terms of tensors at the point p G Af o is now given by 

{<t>l*A){p) = (e^^^A)(£), (122) 

and the ^th-order perturbation is 

Al{p)^-(ClA){p). (123) 

n\ 

The gauge transformation of A is the difference between the two expansions 
when evaluated at a point in Afo- in more common notation, we say ^ = 

An -h AA^, where the primed tensor refers to the Y gauge, the unprimed to the X 
gauge, and their difference is 

AAnip) = 2(£^A)(£) - -(C\A){p). 

m n\ 



( 124 ) 
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The first- and second-order terms are easily expressed in the familiar form 



AAi=/:^iAo, (125a) 

AA 2 = (125b) 

where = Y — X and ^2 = 1^] are the usual gauge vectors. Higher-order 

terms are straightforwardly written down, but since concrete self-force results are 
not available beyond second order, I stop here. 

In a chart, one can show that a gauge transformation can equivalently be thought 
of as a near-identity coordinate transformation rather than a change in identification 
map. First lay a chart on each Me using some identification, say x^(^f (p)) = 
x^(p) for each p e Mo- The two identification maps (j)f and identify the point 
p in Mo with two different points q = (p) and q' = (pl {p) m Me, which are 

related by the active diffeomorphism q \-^ ((pf)~^(q)) and which have slightly 

different coordinate values. Since the chart consists of four ordinary scalar fields, 
we can apply the general expansion (122) to write the coordinates at as 



xi^(q') = x>^{q) + ci'^ixiq)) + 



£,'^{x{q)) + -i\{x{q))d^£,>^{x{q)) 



+ O(e^), 

(126) 

where I have used Cxx^ = 0 to express Y derivatives as ^ derivatives, and I have 
used the fact that x^(p) = x^(^) to express the components on the right-hand 
side as functions of x^(^). Now, rather than an active diffeomorphism on Me, let 
us consider this as a passive change in the chart, x^{q) 1 -^ x'^(x(^)). Define the 
coordinate transformation such that x'^{q') = x^{q). Rewriting Eq. (126) as an 
equation for x^(^) as a function of x^(^0. we get 



x'l^iq') = x>^{q') - e^'^ixiq')) 
+ O(e^). 



(.'iixiq')) - ■^£.Wq'))d^eiix{q')) 



(127) 



Gauge transformation laws for components of tensors can be derived directly from 
this coordinate transformation. For example, by rewriting Eq. (127) as an equation for 
x^{x\p')) and substituting it into the ordinary transformation law for the components 
of the metric, one finds 

dx ^ dx ^ 

g;4^'-e) = ^^ga; 5 (Vx'),e) (128a) 

= e) + e) + e) + O(e^), (128b) 
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where the analogous transformation law for a tensor of 

arbitrary rank is also easily found. To relate this to the language used above, note 
that we are now using a single identification map 0e, and given that identification, 
components of g have identical coordinate values as (0e*g)- Equation (128) applies 
even if e) is not expanded for small e. If it is so expanded, then Eq. (128) 

returns Eq. (125). 

Of course, we are ultimately interested in solving the Einstein equation, and so in 
addition to transformations of the metric, we must consider transformations of curva- 
ture tensors. Useful identities for the transformations of curvature tensors are derived 
in Appendix 3. One of their consequences is that when examining perturbations of 
a curvature tensor, one can derive transformation laws in two equally natural ways: 
directly from Eq. (125) or from the transformations of the metric perturbations. Eor 
example, in a vacuum background, Eq. (125) directly implies 

ASR^Ah^] = C^^Rf,Ag] = 0, (129) 

A(5R^Ah^] + S^R^Ah\ h^]) = C^SR^Ah^]; (130) 

or the same equations can be found by instead using Eq. (125) for the metric itself, 
writing 



A{SR^Ah^]+S^Ri,Ah\h^]) 

= 5R^Ah'^]+5^R^,Ah'\h'^] - {SR^Ah^] + S^R^Ah\h^]) 

= 5R^Ar.^,g] + ^^SRf^uWlg] + 5Rf,AC^y] 

+ 25^Rf,Ah\ y^g] + 5^Rf,Ay^g, y^g] (131) 

and then applying Eqs. (231)-(233). 



6.2 Gauge in the Gralla-Wald Approximation 



I reverse my usual ordering by first considering transformations in the Gralla-Wald 
approximation, which allows the most straightforward treatment. Since the Gralla- 
Wald expansion is an ordinary one, with coefficients independent of e, all the ordinary 
rules apply. 



6.2.1 Transformation of the Metric and the Worldline 

Eirst, let us examine the transformation of the deviation terms in the expansion of 
the worldline. According to Eq. (127), under a gauge transformation the coordinates 
zf^(s, e) = on the worldline become 
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e) = Z>^{s, e) - 



^2^z) - 



+ O(e^), (132) 



where functions of are evaluated at z^(s). By expanding this in powers of e, we 
immediately find 

z;T( 5) = z^(5), (133a) 

z'i''(5) = z^(5) - et(zo), (133b) 

Z 2 (s) = Z 2 (s) - ^ 2 (^ 0 ) + ]j^i\{Zd)dui^{Z(i) - z\{s)d,y^'^{ZQ), (133c) 

where functions of Zq are evaluated at Zq(s). Note that the zeroth-order worldline is 
unchanged; the effect of the transformation is to alter the deviations relative to that 
worldline. 

Now let us examine the transformation of the metric perturbations. Applying the 
transformation laws (125) to the metric, we find with 

zo) = jC,^i9nu, (134a) 

zo) = C^ 29 fiv + zo). (134b) 

Corresponding to the treatment of the worldline, all like all diverge at Zq (s). 
Instead of altering the curve on which the fields diverge, the gauge transformation 
alters the singularity on that curve, by altering the functions z^^q that appear in 
In fact, one need not even appeal to Eq. (132) to determine how the deviation 
vectors are transformed: the transformation laws for z^^q, as given in Eq. (133), can 
be derived directly from those for Eor example, appears in the metric as the 
mass dipole moment jm, and the transformation of can be determined from 
the transformation law for By applying Eq. (134b) to the results (101)-(104) 
in Eermi normal coordinates^^ and comparing to Eq. (105), one quickly finds that 
^6 contributes a mass dipole term. More specifically, 

contributes the relevant term, given by ^^2 This modifies by 

an amount AM^ = from which we read off the same result as in Eq. (133b). 



6.2.2 Transformation of the Singular and Regular Fields 

Though deriving the transformation laws for was trivial, we must put some 
thought into how to do the same for the singular and regular fields and We 



^^At this stage the transformation is applied only to the sums one does not yet 

need transformation laws for the individual pieces h^’l and 
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are free to split into a singular piece and a regular piece however we like, just 
as we were with However, ideally, we can do so in such a way that we preserve 
the nice properties of the split. Allow me to specialize to a nonspinning, spherical 
object, such that there is no force from coupling of moments to external curvature. 
This means the properties I wish to preserve are that the transformed worldline 
[in its expanded form (133)] should satisfy the geodesic equation in the transformed 
effective metric and that effective metric should satisfy the vacuum 

Einstein equation. 

Appropriate transformation laws can be found by noting that for a smooth metric 
and smooth worldline, both the geodesic equation and the vacuum Einstein equation 
are manifestly invariant under a generic smooth coordinate transformation. It follows 
that if the worldline and the metric in those two equations are expanded in a power 
series, and the metric and the expanded geodesic transform according to the standard 
laws of gauge transformations, then the perturbative expansions of the worldline and 
metric continue to satisfy the perturbative geodesic equation and the perturbative 
Einstein equation. In our case, we already have the worldline transforming according 
to the standard transformation law. Accordingly, if we define the regular field to 
transform like an ordinary smooth perturbation, as where 



zo) = (135a) 

zo) = zo), (135b) 

then all the nice properties of the regular field are maintained under a gauge trans- 
formation. However, at the same time, the total field -h must satisfy 

Eq. (134). This means the singular field must satisfy 

= hll + - A/$R«) (136) 

— or more explicitly, 

A^SJ,(x; zo) = 0, (137a) 

= (137b) 



6.2.3 Governing Equations in Alternative Gauges 

By design, the transformation laws (133)-(137) ensure that the governing equations 
of the Gralla-Wald representation are invariant under a gauge transformation: in any 
gauge, the regular field satisfies the vacuum equations 5Rnjj{h^^] = 0, 5Rnjj{h^] = 
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Zq is unaltered and the deviations from it satisfy (35) and (36)^"^; 
and assuming the gauge choice admits a well-posed initial value problem, the full 
field off the worldline can be calculated using the puncture scheme encapsulated in 
Eqs. (117)-(119). These facts follow from the invariance of the geodesic equation 
and Eqs. (129)-(131). However, because of Eq. (137b), the singularity structure of 
the second-order field, and hence the puncture in the puncture scheme, is altered by 
a gauge transformation. We can assess the impact of this change by examining how 
it manifests in the second-order field equation. Using Eq. (233), we have 



- 2S^R^Ah^\ C^g] (139a) 

= 87r£5,7;V - 2S^Ri,Ah^\ (139b) 

The second term on the right is precisely the term required to make a solution 

to the correct equation, SRjj^Ah'^^] = —5^Rf^p[h'^^, — 2S^ R/j,Ah'^^ , at 

points off 70 . The first term on the right-hand side of Eq. (139b) indicates a change in 
the skeletal stress-energy: ^ This change in skeleton corresponds 

to a change in the seed solutions zo, ^^^) and zo, M). An explicit 

expression for can be derived from one for , given in Eq. (215). Erom 

that latter equation, we see that both alters 5m and more notably, shifts 

the mass dipole moment by an amount AM^ = — in agreement with the 
discussion above. 

Of course, to make use of the fact that the governing equations are the same 
in all smoothly related gauges, one must first find an effective metric satisfying 
them in a particular gauge. And given that multiple effective metrics can satisfy 
the same governing equations, one must realize that under a gauge transformation, 
one is referring to the transformation of one’s particular choice of effective metric. 
Eurthermore, there is the potentially more dangerous caveat that the entire analysis of 
this section assumes smooth transformations. Under a transformation that is singular 
on the worldline, the singularity structure of the field can be significantly altered, the 
transformation laws for the deviation vectors are no longer given by Eq. (133), and 
the relation between the “nice” effective metrics in the two gauges will not be given 
by Eq. (135). 



^^Rather than saying the equation of motion (35) is invariant, self-force literature usually talks 
about a transformation of the self-force. At first order, the equation in the new gauge is + 

dTQ 




where the second equality follows from h'^J = h^l + With in the 

left-hand side of the equation of motion, the equation’s invariance is transparent. 
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6.3 Gauge in the Self-Consistent Approximation 

I now consider the freedom in the self-consistent expansion, which is slightly thornier 
than the Gralla-Wald case. 



6.3.1 Transformation of the Metric and the Worldline 

First, I note that the self-consistently determined worldline transforms according 
to Eq. (132). Unlike in the Gralla-Wald case, neither nor z'^ are expanded around 
Zq . 

Now to the metric. Let us adopt the passive view. Suppose we performed the 
self-consistent expansion in two slightly different coordinate systems and 
the unprimed coordinates, the expansion is performed by writing g^^ as e; z) 

and expanding for small e while holding x^ and z^ fixed. In the primed coordinates, 
we write as g'^j^(x' , e; z') and expand while holding x'^ and z'^ fixed. In terms 



of the perturbations = g^^ — g^jy and = g'^^ — g^jy, we have expansions 

e; z) = z), (140) 

e; z') = ^ z'). (141) 

The metric in the two coordinate systems are related, as in Eq. (128), according to 

Ox^ Ox^ 

e; z') = ^-^^apixix'), e; z{z')), (142a) 

= e; ziz')) + £^g^^(x', e; z{z')) 

+ e; z{z')) + 0(e^). (142b) 

Expanding g^^ix' , e; z{z')) returns 

e) = z) + £5ifl/ji/(x')] + e^[/j^j,(x'; z) + C^^g^tvix') 

+ C^,hl^{x'- z) + \c\g^Ax')] + O(e^). (143) 



In Eq. (143) I have simply left z^{z') as z^. Just as in the Gralla-Wald case, 
the term C^^hj^j^(x'; z) introduces a mass dipole moment relative to the reference 
worldline — in this case, relative to z^. But if we now expand z^(zO around z^ = z'^, 
the perturbation reads 



h'nuix', e) = e[h^^^{x'\ z') + Cf^ig^Ax')] + e^[/j^^(x'; z') + C^^diii'ix') 

+ + £^i)h^nu(x' z') + p^,/(xO] + O(e^). (144) 
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where z') = z' , 0 uses the Lie derivative introduced in Sect. 5. 

The two Lie derivatives z') do not precisely cancel one another, 

but does precisely remove the mass dipole moment induced by ; simply put, 
moves everything relative to the worldline, and £^^ moves the worldline by the 
same amount, such that there is no net change relative to the worldline. In other 
words, as we expect, contains no mass dipole term when written as a functional 
of the transformed worldline z'^. The fact that the dipole moment cancels in this way 
can be seen more explicitly by acting with + £^^) on the leading term in 

~ rT%T’ il ^Iso be inferred from the result (216). 

Equation (144) is an expansion at fixed x'^ and Hence, we can read off the 
coefficients of and say that the individual terms in the two expansions (140) and 
(141) are related as 

z') = z') + (145) 

h'l,(x'-, z') = h%(x'-, z') + + £^,)hl,(x'-, z') 

+ £'^29fj,uix') + (146) 

We can see from this analysis that a gauge transformation acts quite differently 
here than in the Gralla-Wald case. In the Gralla-Wald approximation, the zeroth-order 
worldline, on which the singular field diverges, is invariant; in the self-consistent 
approximation, a gauge transformation actually shifts the curve on which the singular 
field diverges. 



6.3.2 Transformation of the Singular and Regular Fields 

As in the Gralla-Wald case, I define the split into singular and regular fields in the 
primed gauge in the simplest way that preserves the properties of the split. In other 
words, we must have the effective metric transform as any ordinary smooth 

tensor field would: = §^§^ga/ 3 ix(x')), which implies g'^^Jx', e; z') = 

^ ^Qaisixix'), e; z{z')), and from there, 

h'llix'; z') = hllix'- z') + £5,5^^(x'), (147) 

h'^^ix'- z') = hllix'- z') + (£e, + £i,)hll{x'- z') 

+ £-^2giJ't^ix') + -C^^gij,u{x'). (148) 

At the same time we must satisfy Eqs. (145) and (146), which leaves the singular 
field to transform as 
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hfjx'; z') = h%ix'; z'), (149) 

z') = h^^^ix'; z') + (£■^^ + z'). (150) 

With these choices, the worldline z'^ is a geodesic of and is a smooth 
solution to the vacuum Einstein equation. 



6.3.3 Governing Equations in Alternative Gauges 

Again as in the Gralla-Wald case, by design, the transformation laws (147)-(150) 
ensure that the governing equations of the self-consistent approximation are invariant 
under a gauge transformation: in any gauge X, the regular field satisfies ] = 

0, = —5^R^^{h ^^ , h^^], where SRJ^^ is the linearized Ricci tensor in the 

X gauge; the center-of-mass worldline on which the field diverges satisfies (28); 
and assuming well-posedness, the full field off the worldline can be computed using 
the puncture scheme (98)-(100), with the replacement —2dR^^. The only 

change is to the form of the second-order puncture. The nature of this change can be 
inferred from the analogue of Eq. (139b), which reads 

= 87r(£^, + f^,)7;V “ 25^Ri,Ah^\ (151) 

This is just as in the Gralla-Wald case but for the presence of the £ term. The two 
Lie derivatives induce a change in the skeletal stress-energy, and hence in the seed 
solutions contained in the puncture, but unlike in the Gralla-Wald case, this change 
does not alter the mass dipole moment; the presence of ensures the mass dipole 
moment remains zero, as discussed above. So \ z, M) vanishes in all smoothly 

related gauges, and only h^^^^{x \ z, 5m) is altered. 



6.4 Gauge in the Osculating-Geodesics Approximation 

Gauge freedom in the osculating-geodesics approximation is more complicated than 
in either of the previous two cases. Here, there is freedom at two levels: in the calcu- 
lations in the Gralla-Wald expansion at each osculation instant, and in the calculation 
of the self-consistent perturbation h^iy(x; z) that stitches together the Gralla-Wald 
expansions. 

If we always use the same gauge at the two levels, then there is no real complica- 
tion: a gauge transformation at the self-consistent level can be expanded out to find 
the induced gauge transformation at the Gralla-Wald level in a straightforward way. 
However, in principle, calculations at the two levels can be performed in different 
gauges. In that case, one must have good control over the relationships between the 
gauges, since the final self-consistent approximation /z^zy(v; z) = 
satisfies the Einstein equation through order only if z^ is in the same gauge as 
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Suppose we wish to calculate z) in a gauge X (denoted by unprimed 

symbols), but at an osculation instant r we wish to calculate the Gralla-Wald pertur- 
bations zo(r)) in a gauge Y (denoted by primed symbols). To find the curve 

in gauge X, we need to solve 



dr^ 



z) = eF^ir; zo(r)) + e^F^ir; zo(r)) 



(152) 



with the forces in gauge X. But as input we have only the F-gauge perturbations 
\ zo(r))- For simplicity, consider only the first-order force in gauge X. It is 
related to F-gauge quantities as [see Eq. (139)] 



^f(r) = 



fZ{t) + 



( 






l± 



dri 






> \ 

0(r) y 



(153) 



where F[^{t) = - zo(r)) - zo(t))]u‘^uI. To know the cor- 

rect force for our self-consistent evolution, we need to know the gauge vector 
However, this gauge vector is constrained by the osculation condition: we must have 
z^{t) = Zq^^^(t) and u^{t) = Hence, we should impose (z(r)) = 0, 

reducing Eq. (153) to 



= f [^ -h 






1± 



drl 



(154) 



Using this relation, one can calculate the force that appears in Eq. ( 152) using the force 
calculated from the Gralla-Wald expansion in gauge F — but one must know 

cItq 

at the osculation instant r. The same analysis can be straightforwardly generalized 
to the second-order force F^ (r; zo(r))- 



7 Equations of Motion from a Rest Gauge 



In the algorithm of Sect. 3, the ^th-order equation of motion is obtained by solving 
the (n + l)th-order Einstein equation in the buffer region. However, at least in certain 
cases, we can instead obtain an equation of motion using only the nth-order solution. 
This is accomplished by finding the relationship between one’s desired gauge — call 
it the practical gauge — and what I call a rest gauge, which is, essentially, any of 
the gauges that have been used to study tidally perturbed objects in the context of 
matched expansions (see Ref. [75] and the many references therein for examples 
of these studies). Methods along these lines were the first ones used to derive the 
MiSaTaQuWa equation [21] and currently, they are the only ones that have been 
used to derive second-order equations of motion [14, 16, 20, 76], although different 
notations and descriptions have obscured the shared underlying features of these 
derivations. 
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Throughout this section, I specialize to the case of an approximately spherical, 
nonspinning object whose leading-order dipole and quadrupole moments vanish: 
l} = =0.1 only sketch the calculations in this section refer the reader to 

jjjiyi Ujiyi j ^ 

Refs. [16, 20] for the details of the calculations in this section. 



7,1 Metric in a Rest Gauge 

By a rest gauge, I mean a gauge in which the object is manifestly at rest relative to 
a worldline and manifestly centered on that worldline. To motivate this idea, I 
return to the themes of the introduction. In Sect. 1.1.2, 1 argued that any equation of 
motion can be written as the geodesic equation in some smooth piece of the metric. 
In Sect. 1.3, I recalled the results of Thorne and Hartle, who showed something 
slightly stronger: that the equations of motion and precession for any compact object 
can be written as those for a test body immersed in some effectively external metric. 
In Eq. (82), the self-force program has recovered their result for the center-of-mass 
motion, and it has shown that the “external” metric in their formalism is in fact (at 
least through order er) the effective metric = gf^jy + 

At second order in e and beyond, the algorithm of Sect. 3 has not yet revealed 
how g/^j^ relates to Thome and Hartle’s external metric. But if we can establish the 
relationship, we can establish the equation of motion. Notably, they derived their 
equations by working in an inertial frame of their external metric; if the object 
possesses no spin or quadrupole moments, then there is no force on the object in this 
frame, and it is what I call the rest gauge. Why I call it a gauge rather than a frame 
will become clear as we move forward. 

The metric in this gauge is most easily constructed through the inner expan- 
sion (39) rather than the outer. To do so, I first note that just as the matching con- 
dition fixes a lowest allowed power of r in the outer expansion, so it determines a 
highest power of r in the inner expansion: expanding the ^th-order inner perturbation 
x^) in the buffer region, we find the condition that 

no negative powers of e occur in the outer expansion then determines 



Here and in what follows, I assume some locally Cartesian coordinates centered 
on the center-of-mass worldline Given an appropriate choice of rest gauge, the 
coordinates will later be identified with Fermi- Walker coordinates centered on z^. 

For our approximately spherical, approximately nonspinning object, the inner 
background metric, when expanded in the buffer region according to Eq. (42), is 
equal to the Schwarzschild metric through order has the schematic form 




(155) 



p<n 



9 



obj 




(156) 
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here I do not specify any particular coordinate system, though I insist that the coordi- 
nates are mass-centered, such that no mass dipole moment appears in the expansion 
of every term in Eq. (156), at the orders displayed, is fully determined by m. 
Using our previous results, we know m is constant. Hence, we can find the perturba- 
tions in the buffer region using the well-developed formalism of perturbations 
of Schwarzschild. Furthermore, given the assumption that the perturbations are qua- 
sistationary (because, as explained in Sect. 2, the inner expansion scales space but 
not time), time derivatives appear as higher-order terms in the Einstein equations. 
That is, one first solves the time-independent linearized Einstein equation 

= 0 (157) 

for with t fixed; here H^] = 0 means the Ricci tensor linearized off 

the background g^j, and the superscript “0” means “terms in 6R^j^ containing no t 
derivatives”. Next, one solves 



H^] = -S'^R-l^g^^K h\ - 5Rl^{g^'°\ (158) 

for at fixed t and for the time derivative of where the superscript “1” in 
means “terms in SR^jj containing one t derivative”. At this point, the pattern 
should be obvious. 

Just as when constructing the outer expansion in the buffer region, we here solve 
a sequence of spatial differential equations. Furthermore, it is well known that in 
an appropriate gauge, stationary vacuum perturbations of Schwarzschild behave as 
1 /^f-f 1 Qj^ yt large f , where i is the spherical harmonic index, just as in the outer 

expansion. Let us label the l/r^+^ solutions as x^) and the solutions 

as where tends to a finite function {t)h^ as r ^ oo. 

Combining this with Eq. (155), we find that at first order, where there is no source, 
the solution behaves as 

//* ~ + 1//*’°’“ + 2//1.1,- 0{\/r^). (159) 

r 

It is well known that the monopole and dipole solutions and are 

pure gauge. Hence, I set them to zero. The gauge-invariant content in is a 

correction to m, and in it is a perturbation toward the Kerr metric — that is, 

a small spin. With a bit of work, one can show from Eq. (158) that these perturbations 
are independent of t , and we can freely set them to zero, absorbing the mass correction 
into m and specifying that the object remains unspinning at this order. Physically, 
this time-independence corresponds to the fact that physical changes in mass and 
angular momentum are caused by tidal heating and torquing, which are caused by 
nonlinear effects at much higher order in e. So in the end, we have 



464 



A. Pound 



Hi, = 0. (160) 

Since Hi, vanishes, Eq. (158) is again the linearized vacuum equation: 

, //^] = 0. The solution behaves as 

h'^ ~ + 0(llP^). (161) 

r 

Again, and can be gauged away. The invariant part of is 

another correction to m, which again can be absorbed into a redefinition of m . Writing 
we then have 

Hi, ~ P^Hhl+{t, r)n‘j + 0{l/r^). (162) 

2 2 + — 

lim^^oo ^^lij be written in terms of two STF functions Sij (t) and Btj (t), 

and in the expansion of (A r) for large r, every term is directly proportional 

to one of those two functions (i.e., constructed from contractions of one of them with 
dij or eijk). These two quantities can be interpreted as quadrupole tidal moments 
in the neighbourhood of the object, and from the matching condition they will be 
identified with those of the external background metric, defined in Eq. (46). 

The third-order Einstein equation now reads 

H^] = -SRI,[3°^K h\ (163) 



Following the same steps as at the previous orders, we find 



H, 



fiv 






+ 



?3 h2,2,+, 
Hmj 



r-Ht:tp;^{l, r)fi‘J 



+ r)n‘j + 0{l/r). 



(164) 



3 3 + - 

r) can be written in terms of two STF functions £ijk(t) and Bijk(t), such 
that every term in its large-r expansion is directly proportional to one or the other. 
These two quantities can be interpreted as octupole tidal moments in the neighbour- 
hood of the object, and they will be identified with those of the external background 
metric, which are constructed from covariant derivatives of the Riemann tensor of 

Equation (164) additionally contains corrections to the quadrupole fields, both 

* * *22 + 

in the form of the time derivatives and Bij appearing in , and in the field 

32 + 22 + 

, which like can be written in terms of two STF functions, call them 

UL^ij 3.nQ OO/y • 

Now let us put the results together. After we rewrite it in terms of unsealed 
coordinates r = er and re-expand in e, the metric g^^ = + e^Hl,{r) + 

<^^Hl,{r) + reads = gf,,{r) + eh'l,{r) + e^hl,{r) + e^h'l,(r) + 

with 
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^ y + mrSij + r^SEij + mr'^{£ijk + £ij) + 0{P), 

^ ^ + mr6£ij + m^r{£ijk + £ij) + O(r^), 

^ -\-^^^£ij + 1^^(£ijk +^0‘) -\-0(r), 

where primes refer to quantities in the rest gauge, and 7^ = diag(— 1, 1, 1, 1). ‘+’ signs 
here mean “plus terms directly proportional to”. For compactness, I have omitted 
the magnetic-type moments Bij, Bij, 6Bij, and Bijk\ they appear in exact analogy 
with the electric type moments £ij, £ij, 6£ij, and £ijk. All terms in the tableau, at 
the displayed orders, are directly proportional to one of these moments and/or to a 
power of m. Every term in the first column is £ = 0, every term in the second is 
1 = 2, and every term in the third is either 1 = 2 (the £ij and Bij terms) ovl = 2 (the 
£ijk and Bijk terms). In this gauge, the object is manifestly at rest on there are 
no mass dipole moment terms, which have the even-parity dipolar behavior /r^, 
nor any acceleration terms, which have the even-parity dipolar behavior rn\ 

By choosing the gauge of the inner expansion appropriately, we can put the exter- 
nal background metric given by the top row of Eq. (165), in precisely the 
form (45) that it takes in Fermi- Walker coordinates — except that no acceleration 
terms appear. Since we know that the center-of-mass worldline is accelerated in 
this tells us that Eq. (165) is actually incomplete. The construction we have followed 
here is valid for any worldline z^, regardless of whether it is accelerating in g^jy or 
not. The fact that we see no explicit acceleration terms is equivalent to the empty 
statement that any worldline z^ can be written as a geodesic in some smooth piece of 
the full metric. Ergo, at this stage we know nothing at all about z^, and to make g^^^ 
the external background spacetime, we must insert into it by hand the acceleration 
terms appearing in Eq. (45). At the same time, in our choice of gauge, no accelera- 
tion terms appear in the full metric g^^. Therefore, we must insert terms into the 
perturbations h'Jl^ such that the sum g^jijj(r) -\- e/z^^^(r) -h e^h'^j^{r) -h e^h'^^{r) -\- 0{e^) 
contains no acceleration terms when one applies the expansion (25). 

The form of g^^ in the rest gauge invites us to make a new split into an effec- 
tively external metric = g^jj l^at contains the m -independent tidal terms 

in Eq. (165), plus a self-field = g^^ — every term of which is directly 
proportional to a power of m. With an appropriate choice of rest gauge, looks 
exactly like Eq. (45) with none of the acceleration terms; that is, our gauge makes the 
Fermi coordinates refer to proper distances, times, Riemann curvature, and parallel 
transport defined with respect to We have 
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077 = -1 - -h 0(r^), 

Qta — “ 3^0/07-^ “h 0(r ), 

Qab — ^ab y^^aibj^ -\~ CT{r ), 



(166a) 

(166b) 

(166c) 
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where = Riiavp + + O(e^) is the Riemann tensor of As 

per the discussion of acceleration terms in the preceding paragraph, the new “regular 
field” reads 



[)f/ = 2F(ni - SFoiOj[fj'^]x‘xJ + 0(r\ (167a) 

= -ISRoiajlb’^WxJ + O(r^), (167b) 

= -^SRa/bj[l)^]x‘xj + O(r^), (167c) 

1)^2 = 2Flm + 0(r2), [)R2 ^ f^R2 ^ (igg) 

and = 0{r). For visual clarity, I have truncated these equations at lower orders 
in r than Eq. (165). 

In summary, in a rest gauge the metric perturbations take the form 

= C + (169) 



where is made up entirely of terms containing explicit factors of no mass 
monopole corrections, no spin moment, no quadrupole moment, no mass dipole 
moment, and no acceleration terms, and is given by Eq. (167)-(168). Equa- 
tion (169) is the general solution in the buffer region outside an approximately 
spherical and nonspinning object, just written in a particular gauge. In this gauge, 
is manifestly a geodesic of an effectively external metric, and is being defined as 
that effectively external metric in which is a geodesic. A priori, need not be 
related to the latter was defined in a very different manner and is not yet known 
to be the metric in which the motion is geodesic (except at zeroth and first order). 



7.2 Self-Consistent Approximation: Worldline-Preserving 
Gauge Transformations 

Metrics in local rest gauges are common in the literature, usually derived in the 
context of tidally perturbed compact objects. The goal of the self-force game is not 
simply to construct such a metric, but to find the equation of motion in whichever 
“practical gauge” one wants to use to compute a global solution. 



^^Note that without some additional input beyond that definition, the split of into and 

is quite ambiguous. Suppose there were a zeroth-order force acting on the object. One could 
still write the equation of motion as a geodesic in some smooth piece of the metric, but to do so, 
one would have to shift part of into ; one would not simply be splitting the perturbations 
With the present setup, the ambiguity is lifted by assuming the expansion (25) and utilizing 
the independently determined fact that Fq = 0. 
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Let us take the Lorenz gauge as the practical gauge. I wish to find a unique gauge 
transformation between the rest-gauge perturbations (169) and the Lorenz-gauge 
perturbations (70)-(73). That is, I seek gauge vectors and ^2 satisfying 

= (170) 

~ ~ k^\i9nv — (171) 

Note that here I do not utilize the third-order perturbation, though its form was 
found in the rest-frame gauge. What is its relevance? Only that it contains no mass 
dipole moment term; in other words, it shows that with this choice of gauge, the 
correction to the mass dipole moment can unambiguously be set to zero to define a 
center-of-mass-worldline. 

These equations are to be solved subject to one crucial condition: they must pre- 
serve the location of the worldline. The rest-gauge solution shows that the object is at 
rest on some worldline, the Lorenz-gauge solution is written in coordinates centered 
on some worldline, and in order to claim that the worldline of the Lorenz-gauge 
solution is the desired center-of-mass worldline, we must enforce that it be identical 
to the worldline of the rest-gauge solution. In the simplest scenario, Eqs. (170) and 
(171) can be solved with a smooth gauge transformation of the form 

= ( 172 ) 

p>0l<p 



Given Eq. (132), the condition that the worldline not be altered is that 



Coo(0 = Cl^ = 0. (173) 

I call a transformation satisfying this condition a worldline -preserving transforma- 
tion. This condition was first used in Ref. [21]. A more general condition is used in 
Refs. [14, 16]. 

By working through sequential orders of r in Eqs. (170) and (171), one uniquely 
determines {t) (up to residual gauge freedom in the rest gauge and Lorenz gauge). 
I do not present details of that process here, but the explicit solution to the first- 
order equation (170), with particular choices of rest gauge, can be found in various 
references; see, e.g.. Ref. [12]. The explicit solution at second order is presented in 
Ref. [16]. What is the essential result in these solutions? At first order, they uniquely 
determine , which appears via Eq. (167), to be the MiSaTaQuWa force (66) (with 
=0), as we already know. At second-order, they uniquely determine the second- 
order force F 2 , which appears via Eq. (168), to be 



Ff 







R1 
tt • 



(174) 
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Following Appendix 1, the results for and can be combined to write 
the equation of motion = eF^ + e^F^ + 0(e^) as the geodesic equation 

= 0{e^)mg^jj = This is the result promised way back in Sect. 1.5.1. 

What is the meaning of this result? In the rest gauge, we knew that was a 
geodesic in Now we know it is a geodesic in Either of or g^jy can be 
thought of as a “nice” effectively external metric. Are they the same metric? Not 
necessarily. The two geodesic equations only restrict the metrics and g^^jy on the 
worldline and their first derivatives on the worldline; they place no constraint on 
higher derivatives. But one of the results of the process of finding and ^2 is that 
the two metrics are related by 

C = + + (175) 

+ 0{r'^). (176) 

In other words, they are equivalent up to possible 0{r^) differences. Are they the 
same at order and higher? The answer depends on precisely how the rest gauge, 
and the tidal moments d£ij , 6Bij (and at higher order in r , octupolar and higher order 
moments) appearing in is constructed; this construction is, unfortunately, not 
unique. Reference [16] demonstrates explicitly that the two effective fields can differ 
at order r^. 

Is there a way to determine which effective field is the “best one”? As described 
in Sect. 3.5, the most promising route is by solving the field equations outside an 
object with spin and higher moments, since those moments will couple to some tidal 
moments, and we can determine which metric the tidal moments belong to, and 
thence the metric in which the object moves as a test body. But just with the results at 
hand, we can confidently say that g^jy has all the nice properties one might desire of it 
for a spherical, nonspinning object: it is a vacuum solution, causal on the worldline, 
and the object moves on a geodesic of its geometry. 



7.3 Gralla-Wald Approximation 

Starting from the self-consistent results, one can readily derive the second-order 
Gralla-Wald equation of motion (36) following the expansion procedure of Appendix 
1 . 

Alternatively, one can derive a second-order equation of motion directly, as was 
done by Gralla [20] . In the Gralla-Wald case, the rest gauge is constructed in coor- 
dinates centered not on but on Zq. The results are the same as in Sect. 7.1, but all 
acceleration terms are set to zero in g^iy (and likewise for the F^ terms in h'^^ that 
cancel them). When transforming to a “practical gauge”, one uses 
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^1,1^ = + \^\i9nu + 

and one does not impose the worldline-preserving condition. The gauge transforma- 
tion is allowed to move the object relative to Zq. Since zf = z '2 =0 in rest 
gauge, we have from Eq. (133) that in the practical gauge 

4(^) = -Cf(zo), (179) 

Z2(s) = -^ 2 (^ 0 ) + ^^i(zo)d„^^{zo), (180) 

and the covariant second deviation is Z 2 f(s) = —^ 2 (^ 0 ) + (zo)'^u^i(zo). 

Following Gralla, one can use this procedure to obtain results in any gauge 
smoothly related to a particular rest gauge. Defining regular fields 

= (181) 

(182) 

one finds evolution equations for Zn in terms of by taking appropriate derivatives 
of these relations; any desired gauge condition is imposed on is computed 

via a puncture scheme, and Zn is computed from Because of the particular 
method of construction, in which the rest gauge is defined with respect to Zq rather 
than z^, the evolution equations for Zn do not correspond to an expansion of the 
geodesic equation in the metric = g^jj thnt Gralla defines; this lack of 

geodesic motion in should amount to a slightly different definition of 

than the one used in the self-consistent expansion, although no one has yet carried out 
a detailed comparison of Gralla’ s second-order results to those of the self-consistent 
method. 



8 Conclusion 
8.1 Summary 

In this review, we began with the idea of a point particle interacting with its own 
backscattered field; we ended with the idea of using laws of gauge transformation 
to obtain laws of motion for extended objects. And yet, reassuringly, from the latter 
description we have recovered the former. 

Along the circuitous way, I have emphasized several core ideas. First, the laws of 
motion of an extended object, to all orders in perturbation theory, are determinable 
from the metric outside the object; all information about the object’s shape and 
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internal composition is encoded in a discrete set of multipole moments, themselves 
defined from the form of the metric in the buffer region outside the object. To obtain 
an equation of motion of order in perturbation theory, moments of multipole 
order £ = ^ are required. Second, the laws of motion found in this way show that at 
least through second order in perturbation theory, a small, sufficiently spherical, suffi- 
ciently slowly spinning compact object moves on a geodesic of an effectively external 
metric that (i) satisfies the vacuum Einstein equations and (ii) behaves causally on 
the object’s representative worldline. Furthermore, the analysis of the metric outside 
the object provides a clean way of separating the metric into two constituents: a 
suitable effectively external metric satisfying the above properties, and a self-field 
that, loosely speaking, locally characterizes the object. 

This treatment weds several traditions that run through the history of the problem 
of motion in general relativity, from the characterization of motion of a material body 
in terms of suitably defined multipole moments in the tradition of Mathisson [35] 
and Dixon [34], to the method of algorithmically characterizing the metric in terms 
of multipole moments in the multipolar post-Minkowskian theory of Blanchet and 
Damour [5, 66], to the derivation of laws of motion of asymptotically small objects 
from the Einstein equation outside the objects in the tradition of Einstein [49], 
D’Eath [43], and Thorne and Hartle [46]. The perturbative treatment here also com- 
plements the non-perturbative treatment of Harte [36]. Just as Harte brought Dixon’s 
project to fruition by finding a utile separation of the metric into a self-field and an 
effectively external metric, so the same can be said about the method here bringing 
to fruition the project of Thorne and Hartle. 

An important aspect of any perturbative treatment of motion is the relationship 
between that motion and gauge freedom. It is well known that on short timescales, 
the self-force is pure gauge; it can be freely set to zero by a suitable choice of gauge. 
Equivalently, one can say that on short timescales, the perturbed worldline of the 
object can be transformed to a geodesic of the background spacetime. 

What physical role, then, does the self-force play? On short timescales, one can say 
the following: the deviation from geodesic motion appears explicitly in the second- 
order field. Hence, in order to get gauge-invariant information about the second-order 
field from the solution to the Einstein equation in any given gauge, one must include 
the effect of the self-force. (Note that the argument doesn’t run in reverse. Since 
the first-order deviation is determined entirely by the first-order gauge choice, one 
does not need the entire second order perturbation to obtain invariant information 
involving the first-order deviation [77].) However, on these short timescales, the 
perturbative correction to the motion need not be accounted for to obtain physical 
information from the first-order metric. 

As I have argued, this situation changes when one considers long scales. If one 
seeks a solution valid on a large domain, such as over the course of a binary inspiral, 
then one must account for the correction to the motion in order to obtain a well- 
defined perturbation theory. If one were to attempt to put the perturbative shift in 
the motion into the second-order field, then on this timescale the second-order field 
would grow larger than the first-order field. Hence, one must incorporate the effect 
on the motion into the first-order perturbation. In this sense, the perturbed worldline 
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itself is quasi-invariant under a gauge transformation on long timescales; it may only 
be shifted by a small amount of order e, while the shift due to the self-force is a much 
larger effect, of order (or depending on whether one is looking at orbital 
phase or orbital radius in an inspiral, for example). 

For this same reason, if one is interested in long timescales, one need not include 
the second-order perturbation in order to obtain gauge-invariant information about 
an inspiral. For any given short patch of that inspiral, the effects of the self-force will 
be pure gauge; but the accumulation of those effects over long timescales is invariant 
within the class of gauges that are well behaved on these timescales. 



8.2 Future Directions 

Although the foundations seem in place and the second-order results seem sufficient 
for the practical purposes we can presently imagine, several open avenues remain to 
be explored. 

First, at present we have obtained second-order equations only for objects whose 
spin and quadrupole moments vanish at leading order. It would be worthwhile, 
and astrophysically relevant, to obtain second-order equations of motion for more 
generic compact objects. As is well known [34], the object’s leading order quadrupole 
moment would generate a force by coupling to the external curvature. Additionally, 
there may be an effect due the correction 6S^ to the object’s spin. 

The mention of this subleading spin raises another question to consider: what is 
the physical content of the perturbed multipole moments? I have discussed above 
how the monopole correction 8m is pure gauge (at least on short timescales). 
Will higher corrections, or corrections to higher moments, contain more physical 
information than this? We can expect that they will: tidal heating and torquing, for 
example, should create physical corrections to the mass and spin [78, 79], and at 
least for a material body, the external tidal fields should correct the higher moments 
by physically deforming the object [80-82]. Ultimately, to model the motion of a 
particular class of objects, such as realistic neutron stars, one will need to explicitly 
match an inner expansion to the outer expansion I have discussed; the matching 
procedure will uniquely identify the multipole moments of the objects as they appear 
in the metric in the buffer region. To this end, one might consider metrics of tidally 
perturbed neutron stars, such as those described in Refs. [80, 81]. 

One might also try to describe less compact astrophysical objects, or more exotic 
objects, by broadening the scope of the perturbative expansion. Rather than assuming 
the object is compact, such that its linear dimension d is of the same order as its mass 
m, one might consider a two-parameter family corresponding to m and d. This would 
alter the orders at which various multipole moments appear in the buffer region, since 
they scale as md^ . One could even try to describe approximately string-like objects 
by examining a limit in which two of the object’s linear dimensions go to zero while 
one linear dimension remains finite. 
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To gain further insight into the physical content of the multipole moments, one 
could also relate them to the moments appearing in other formalisms, such as the 
non-perturbative formalism of Harte [36]. Another point of comparison with Harte 
would be the definition of the effectively external (i.e., the “regular”) field. As I have 
discussed, the choice of effectively external field is far from unique, and in fact an 
infinite number of choices could be made that would still guarantee the center-of- 
mass motion is geodesic in the effective metric. Here, I have taken as my guideline 
that the effective metric should be a vacuum solution that is causal on the worldline 
and in which the motion is geodesic; these conditions still do not uniquely identify an 
effective field, but they narrow the range of options, and I have shown how a choice 
satisfying these conditions arises naturally in the process of solving the Einstein 
equation in the buffer region outside the object. However, Harte makes a different 
choice, which does not satisfy the vacuum Einstein equation. How are the two related? 
The answer is not obvious. One way of establishing a stronger relationship would be 
to find perturbative equations of motion for objects with higher multipole moments: 
because moments of higher I couple to higher derivatives of the metric, they feel 
much more of the field than does a monopole. With Harte ’s definition of effective 
field, the equation of motion is precisely that of a test body in the effective metric. 
Would the same be true in the effective metric I have defined here? 

In addition to comparison with Harte, the results I have described should be related 
to Gralla’s more closely related, perturbative results [20]. At second order, Gralla 
makes a choice of regular field that is a vacuum solution (at least to the order in r to 
which he defines it), but in which the center-of-mass motion is not geodesic. 

Although I have emphasized the idea of splitting the physical metric into a self- 
field and an “effectively external” metric, and although I have linked that split to 
a generalized equivalence principle, the array of choices of regular field should 
make clear that there is a danger of over-interpreting the physical meaning of any 
particular choice, no matter how nice its properties. This risk is also present because 
in general, a regular field satisfying nice properties on the object’s worldline is 
acausal when evaluated away from the worldline. As a practical matter, the freedom to 
define different effective metrics may make future comparisons of self-force results to 
post-Newtonian results more hairy; given the vast freedom — even while maintaining 
the property that the motion is geodesic in the effective metric, for example — it is 
remarkable and fortuitous that agreement has so far been found for so many different 
effects that appear to rely on particular choices of this field [83-85]. 
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Appendix 1: Expansions of the Geodesic Equation 
Expansion in Powers of a Metric Perturbation 

In this appendix, I examine the expansion of the geodesic equation in any sufficiently 
smooth metric the treatment is generic, not specialized to a spacetime containing 

a small object. I expand only in powers of a sufficiently smooth metric perturbation; 
I do not expand the worldline itself. Hence, the analysis is meant to apply to the 
self-consistent representation of motion, not to the Gralla-Wald representation. 

The geodesic equation reads 

^ (183) 

where s is a. potentially non-affine parameter on the curve z^(s), ^ is its 

tangent vector field, is the Christoffel symbol corresponding to g^^, and k = 

If we now write the metric as the sum of two pieces, g^^ = g^jy + and if we 
take s = T, the proper time on z^ in and if we rewrite the geodesic equation in 
terms of covariant derivatives compatible with g^jy, we find 

(3^ = -AT^^yuP + (184) 



where and 



{2hs(p-,,) - dss) 

is the difference between the Christoffel symbol associated with the full metric g^^ 
and that associated with the background metric g^jy . With r as a parameter, k becomes 



- hn^ui^u'' 

yi - 



(186) 



So far no approximation has been made; Eq. (184) is exact. If we now expand 
and k in powers of h^jy, we find 



a 



a 



= ~ h°"^){2h6(j3-,7) ~ hi3yy)u^u^ - ihi3^.yu°‘u^u"/u^ 

— ■;^h^phf}-yyu°‘u^u^u^u^u'' — hfj^u^a^u^ + O(h^). 



( 187 ) 
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This equation is complicated by the fact that the acceleration appears on both sides 
in a nontrivial way. To disentangle the acceleration from the perturbation, I assume 
that a^, too, has an expansion in powers of 



- Kn + «quad + 



(188) 



where is linear in hf^iy and is quadratic in it. Substituting this expansion 
into Eq. (187), one finds 



<uad = n{2hs(p-,^) - 



(189) 

(190) 



where Summing these, we have 



cIt^ 



= - /*^)(2/*5(ft^) - + 0{h?). (191) 



As applied to the case of the effective metric g^jy = g^jy + (i.e., replacing h^jy 

with Eq. (191) agrees with the second-order self-forced equation of motion 
derived in the body of the paper. 



Expansion in Powers of a Metric Perturbation 
and a Worldline Deviation 

In the last section, I expanded the geodesic equation while holding the solution 
Ze (s) to that equation fixed. I now expand Ze (s) as well. This procedure yields a 
sequence of equations for the terms in the expansion of Ze(s), suitable for a Gralla- 
Wald approximation. My approach to the expansion closely follows the treatment of 
geodesic deviation in Sect. 1.10 in Ref. [86] 

I first describe the geometry of the situation. Consider a family of world- 
lines z^(r, e), with each member Ze (r) = z^(r, e) governed by the equation of 
motion (184). Each member satisfies 

--^ = F^(r,e), (192) 



where r is proper time on Ze, and is given by the right-hand side of Eq. (184). 
The family generates a two dimensional surface S with a tangent bundle spanned by 
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^ and An important relation between these vector fields can be 

found from which implies CuV^ = 0 = and from there, 

(193) 

Now, we seek to describe the deviation of an accelerated worldline Ze (.t) from 
the zeroth order, geodesic worldline Zq (t) = z^(r,0). The first step is to expand the 
worldline in the power series 

z^(r, e) = Zq{t) + ezi(r) + e^z^(r) + O(e^), (194) 



where 



1 d^z^^ 

= (195) 

n\ oe^ 

We may also write the expansion as z^(r, e) = X h.^^^v^^\zoir)' Note that here z^ 
is a scalar field equal to the /rth coordinate field on the surface S. The leading-order 
term is the family member Zq (t) = z^(r , 0). The second term is (r) = C^z^lzo ~ 
(zo (^)) , a vector on Zq • But at second order and beyond, a subtlety arises : unlike the 
first derivative along a curve, which is a tangent vector, second and higher derivatives 
are not immediately vectorial quantities. The function z^(r, e) describes a curve in a 
particular set of coordinates, and the corrections Zn depend on the coordinate system 
in which one defines z^(r, e). Since my notion of an object’s center is established 
with reference to a comoving normal coordinate system, I wish my covariant measure 
of the second-order deviation to agree, component by component, with 5 ^^('r, 0 ) 
when evaluated in a normal coordinate system centered on Zq; Sect. 5 describes 
the utility of this choice when re-expanding a self-consistent approximation into 
Gralla-Wald or osculating-geodesics form. With that in mind, I define the vector 

1 Dv^ 1 

= -v^V;3v“ (196) 

2 de 2 

and I seek an evolution equation for its restriction to Zq, 

Z^^(T)^W^Ur)- (197) 

Z 2 f is the second-order term in the expansion (194) when that expansion is performed 
in Fermi normal coordinates centered on Zq • 

In addition to the choice of coordinates, the expansion (194) depends on the par- 
ticular choice of parametrization (r , e) of the surface S. A change of parametrization 
alters the direction of expansion away from Zq • Here, the parametrization is chosen 
such that r is proper time along each curve Ze (^), and a flow line generated by 
links points on different curves Ze (^) at the same value of r. When restricted to Zq , 
the parameter r is tq, the proper time on Zq . 
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With all preliminaries established, I now proceed to find the evolution equations 
for Zq, Zi, and Z 2 • The leading term clearly satisfies 



dT^ 



0) = 0. 



(198) 



For the others, I first find evolution equations for and and then evaluate the 
results on Zq . At first order, using Eqs. (192) and (193), we have 



dr^ 



II 

s 

—3' 


(199) 


\ / ;7 






(200) 


V ’ / ;7 






(201) 



where the second line follows from Eq. (193) and the third line follows from the 
Ricci identity and Eq. (192). Evaluating on Zq , I write this as 

^ = Ff (ro) - R'^ (202) 
dr^ 



where This is a generalization from the usual geodesic deviation 

equation to the deviation between neighbouring accelerating worldlines; it is valid 
even if F^(r, 0) / 0. 

At second order, repeated use of Eq. (193) and Ricci’s identity leads to 



dr^ 






(203) 



J;5 



^ +2 









) 



. (204) 



Evaluating on Zq and using Eq. (192), we can write this as 



dTn 



= F^iTo) - R°‘,iI3v (mo22f“o + 



+ '^R°‘ ^.Pv-^z!f z^^ 



(205) 



where Equation (205) describes the second devi- 

ation between neighbouring accelerating worldlines. In the case of neighbouring 
geodesics, it agrees with “Bazanski’s equation” in the form given in Eq. (5.9) of 
Ref. [87]. 
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V ij. 

The quantities appearing in Eqs. (202) and (205) can be straightforwardly 
evaluated by performing the expansion e) = e^^^(x) + e^/z^^(x) + 0(e^) in 

Eq. (191) and then taking covariant derivatives with respect to v^. The results are 



and 




(2^L;A - ^^A;.) ~ {2K.;XS 

- - Kx;,) 

+ Po'^hlf - Kxm) 



(206) 



^aX-u6) ^O^O^l 



(207) 



As applied to the case of the effective metric = g^jy F (i.e., replacing 

with ^^^), Eqs. (202) and (205), together with Eqs. (206) and (207), are the 
second-order expansion of the motion that apply in a Gralla-Wald approximation. 



Appendix 2: Expansion of Point-Particle Fields in Powers 
of a Worldline Deviation 

In this appendix, I derive the linear terms in expansions of the point particle stress- 
energy {x\ z) and the Lorenz-gauge retarded field z) when the worldline 

is expanded as z^{s, e) = Zq (s) -h ezi (s) -h O(e^), where 5' is an arbitrary parameter. 
I also establish the identification of these linear terms with the mass dipole moment 
terms found from the local analysis in Sect. 3. 



Stress-Energy 

I write the stress-energy in the parametrization-invariant form [9] 

T^^(x; z) = m ( g'^,{x, z)g^^,{x, z)F 5 {x, z)—j= =, (208) 

where (v ; z) is a parallel propagator from the source point v' = z(^, e) to the field 
point V, and z^ = 



478 



A. Pound 



Substituting the expansion (30) into this stress-energy tensor, we obtain 



z)=m f 5",(x, zo)g'^f{x, zoYzq 

J7O L -■ 



r(5(x, zo) + ezfViy'S(x, z)|e=ol 


J Jo<5'zJ Vyz^' 


L J 


Zq^Ok' 



ds 



+ 0(e"). 



•P • 

(209) 



In each instance, the evaluation at e = 0 occurs after taking the derivative. 

I simplify this expression using the distributional identities V 5 {x, z) = 
z) and z) = 0 [9]. I also use the identity 



4'(V/-'^“')L=o = 



ds 



1 

— ’ 



( 210 ) 



which follows in the same manner as Eq. (193). 
The result of those simplifications is 



eT^^{x; z) = eT^^^ix; zo) + e^ST^^^ix; zo, Zi) + O(e^) 



pa/3 



aj3 ^ 



(211) 



with 



Ty^{x\ zo)=m f S(x, zo)dTQ, (212) 

-'70 

5T°^{x-, zo, zi) = m f g2,g^,\h.u^p wf ^ - mo'mo Moyw^ ) <5(x, zo) 

-Mq'm^Zi 5yV^(5(x,zo)]<i?To, (213) 

where u^iro) = , and the parallel propagators are evaluated at (v , zo (^o^) • I have 

simplified these expressions by reparametrizing Zq in terms of ro, the proper time on 
70 , but note that this does not correspond to choosing the original parameter s = tq. 
Equations (211)-(213) are valid for any choice of parameter i*, and z^(ro) actually 
depends on the original choice of 5': a change of parametrization 5' ^ s\s,e) will 

change the direction of Zp in particular changing whether or not z^ is orthogonal to 

11 

To eliminate this dependence on the initial choice of parametrization, I rewrite 
in terms of the orthogonal part of Zp z^^ = + UqUqjj)z\. The result is 

= m [ llPp uf]_5{x, zo) ~ ^i±ff^Xjd(x, zo)l dr^, (214) 

J 7 O L -■ 
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where Note that the part of parallel to Uq does not appear in this 

expression. Again, this result does not depend on the initial choice of parametrization. 
One need not choose a parametrization by hand that enforces = 0; no matter 

the choice, only the perpendicular part plays a role in the field equations. 

The quantity STI^^(x; zq) is equal to £yTf^^(x; zo), where intro- 

duced in Appendix 1, and £y is the Lie derivative introduced in Sect. 5. In fact, for 
any vector z) is given by Eq. (214) with the replacement z^ ^ 

and Uq u^. This quantity is useful when considering gauge transformations in 
the self-consistent approximation. Also useful is the ordinary Lie derivative of ; 
taking similar steps as above, one finds 



z) = -m f 

J'y 



2u 



{a 



+ 



7 



dr' 
dr\ 






(215) 



where = P ^' and . The sum of the two Lie derivatives yields 

the simple result 

z) = -m j 7 z)dr' . (216) 

A z) term signals that the mass m is displaced from z^ by an amount 

the lack of any V^6(x, z) term in Eq. (216) signals that the displacements due to the 
two derivatives cancel one another, leaving the mass m moving on z^. 



Metric Perturbation 



In the Lorenz gauge, the first-order self-consistent field, given some global boundary 
conditions, is given by 



/z^^(x;z) = 4m j u""' dr' , (217) 

where is the Green’s function that comports with the global boundary con- 

ditions. I wish to expand this about z^ = Zq to obtain something of the form 

e/j|,^(x: z) = eh^^^{x\ zq) + Zo, zi) + O(e^). 



(218) 
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There are two methods available for achieving this: directly, following steps analo- 
gous to those in the previous section; or by making use of the result of the previous 
section. 

Here I adopt the second method. Noting that zo, ^l) = £vhj^u(^’^ that 

and that £y commutes with derivatives acting at v, we have 



Ef,^[£vh^](x; zo) = £vEfj,u[h^](x; zo) = -l6Tr£vT^j,{x; zo)- (219) 



Hence, 



Shj^jyix-, zo, zi) = £vhlj,{x; zo) = 4 j {x'\ zo, z\)dV' . (220) 

From Eq. (214), this evaluates to 



5h\^i,{x-, zo, zi) =4m j (2G^,,^vMo^ u\l + “o 4 _ l ) ^"^o- (221) 

-'70' ' 



Gauge Condition 

It is worth examining how z) and Shj^^(x; zo, zi) contribute to the Lorenz 

gauge condition. Those contributions are easily found by invoking the identity 
= —Gfiiii'-y) [9], where G^i^f is a Green’s function for the vector wave 
equation = 5^, and both Green’s functions must satisfy the same boundary 
conditions. Performing a trace-reversal on Eq. (217), taking the divergence of the 
result, using the Green ’s-function identity, and integrating by parts yields 

z)=4j Gaa'^(mu^')dT. (222) 

(Earlier results in this section assumed constant m, but here I momentarily leave 
it arbitrary for generality.) The contribution to the gauge condition is determined 
entirely by ^ and the acceleration of z^. In a Gralla-Wald expansion, one has 

zo) = 0, from which one can read off ^ = 0 and = 0. In a self- 

consistent expansion, one instead has V^[e/z^^(v; z) + e^hl^ix; z)] = 0{e^) [or 
more precisely, Eq. (26)], which determines Eq. (9). 

Doing the same with Eq. (221) yields 



ZO, Zl) 



= 4mj Gaa'^ 



dr'^ 



+ Zi_L^ 



) 



dTn 



(223) 
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The contribution to the gauge condition is determined entirely by the acceleration 
of the deviation from Zq (together with the geodesic-deviation term). In a Gralla- 

Wald expansion, zo, zi) is included in zq), and zq) = 0 

determines Eq. (35). 



Local Expansion and Identification of Mass Dipole Moment 

In Sect. 4.1, 1 showed that the mass dipole moment of the object creates a term (96) 
[and contributes to the term (94)] in the object’s skeletal stress-energy. By compar- 
ing that result to Eq. (214), we can make the identification = mz\, exactly as 
concluded elsewhere in the paper. Since the two stress-energy tensors are the same, 
it follows that \ zo, zi) includes the entire contribution to coming from 

the mass dipole moment. 

Here, I complete the circle by performing a local expansion of Eq. (221) near Zq 
and showing 5h^^^{x \ zo, zi) reproduces the order- 1/r^ and 1/r terms for the mass 
dipole seed solution in Eermi normal coordinates. The method of local expansion is 
elaborated in Ref. [9]. In particular, I follow steps analogous to those in Sect. 23.2 
of that reference. To avoid belabouring the details, here I provide only the barest 
sketch; I leave it to the interested reader to fill in the gaps using the tools of Ref. [9]. 

The starting point is the Hadamard decomposition of the retarded Green’s func- 
tion, = Uniyn'jy'6^(a)-\-Vniyn'iy'0^(-a). Hcrc 6^(a) is aDirac delta function 

supported on the past light cone of v, and 6>+(—cr) is a Heaviside step function sup- 
ported in the interior of that light cone. cr(x, x') is one-half the squared geodesic 
distance from x' to x, such that it vanishes when a null geodesic connects the two 
points. 

Erom this starting point, the final result is obtained by a two-step process: (i) 
evaluating the integrals over Tq by changing the integration variable to a, and (ii) 
writing the retarded distance from v' to v in terms of the Eermi radial coordinate r. 
The results for the two terms in Eq. (221) are 



8m 






(j Dz\ 1 ^ , 8m • n 

Wn + O(r^), 



dri 



0 



(224) 



where z\ = ^ , and 



4m 






2m 



• livU' 



“o “o = ~^z‘{na5^i,u + Oir'^). 



(225) 



The components in Eermi coordinates are 
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5h\, = + 0(r°), (226a) 

5h]^ = + £)(r°), (226b) 

Kb = ^^^^Sab + 0 (r°). (226c) 

Here we see that with the identification = mz\, the tt and ab components 
are precisely those in Eq. (105), and the ta component is precisely the term pro- 
portional to W in Eq. (106). In other words, the integral (224) corresponds to the 
mass dipole moment’s contribution to the monopole seed zo, Sm), while the 

integral (225) corresponds to the mass-dipole seed zo, ^)- 



Appendix 3: Identities for Gauge Transformations 
of Curvature Tensors 

Let A[g] be a tensor of any rank constructed from a metric g. (To streamline the 
presentation, I adopt index-free notation throughout most of this appendix.) Now 
define 



1 

6"A[fu ^ —A[g + Ai/i + ■ ■ • + A„/„]| (227) 

This tensor is linear in each of its arguments f\, . . . , fn\ it is also symmetric in 
them. In the case that all the arguments are the same, we have 5^ A[h, . . . = 

^ ^A[g -h A/i] the piece oi A[g -\-h] containing precisely n factors of h and 
its derivatives. 

The following identities are easily proved by writing Lie derivatives as ordinary 
derivatives: 



C^A[g] = 5A[C^g]^ (228) 

\C\A[g] = \5A[C\g^ + S'^A[C^g, C^g], (229) 

C^5A[h\ = 5A[C(h] + 25^A[C^g, h], (230) 

Note that 5^A[C^g, h] = 5^A[h, C^g] = \ {5^A[h, C^g] + 5^A[C^g, h]). As an 
example, if A is the Ricci tensor, then 

£5 (231) 

\C\r^A 9] = \5R^AC\g] + 5^R„AC^g. C^g], (232) 

C^5R^,Ah^ = 5R^,AC^h] + IS^R^Ah, £ 55 ], (233) 

where I have restored indices to avoid confusion with the Ricci scalar. 



Motion of Small Objects in Curved Spacetimes . . . 



483 



To establish Eq. (228), one can write the metric as a function of a parameter A 
along the flow generated by ^ and then perform a Taylor expansion: 



AAM = 






A=0 



= SA 



A=0 



dX 



A=0 



= 5A[C^gl (234) 



Similarly, to establish Eq. (229), one can write 






. dq 

5(0) + 

dX 



, 1^2 dg 

A=0 + dX^ 



A=0 



A=0 



= \5A[Clg]^5^A[C^gXig], 



(235a) 

(235b) 



and to establish Eq. (230), one can write ^ as a function of parameters (A, e), where 



h = 



— ^1 



de le=0 



, and then write 






dXde 



g(X, 0) + e^(A, 0) 
de 



\=e=q 



dXde 



d^g 



5(0, 0) + A^(0, 0) + e^(0, 0) + Ae 

dX de dXde 



( 0 , 0 ) 



5A[C^h]+25^A[h,C^g]- 



(236a) 



A=e=0 

(236b) 
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Self-force: Computational Strategies 



Barry Wardell 



Abstract Building on substantial foundational progress in understanding the effect 
of a small body’s self-field on its own motion, the past 15 years has seen the emergence 
of several strategies for explicitly computing self-field corrections to the equations 
of motion of a small, point-like charge. These approaches broadly fall into three 
categories: (i) mode-sum regularization, (ii) effective source approaches and (iii) 
worldline convolution methods. This paper reviews the various approaches and gives 
details of how each one is implemented in practice, highlighting some of the key 
features in each case. 



1 Introduction 

Compact-object binaries are amongst the most compelling sources of gravitational 
waves. In particular, the ubiquity of supermassive black holes residing in galactic 
centres [1] has made the extreme mass ratio regime a prime target for the eLISA 
mission [2-6] . Meanwhile, the comparable and intermediate mass ratio regimes are 
an intriguing target for study by the imminent Advanced LIGO detector [7]. In order 
to maximise the scientific gain realised from gravitational- wave observations, highly 
accurate models of gravitational- wave sources are essential. For the case of extreme 
mass ratio inspirals (EMRIs) — binary systems in which a compact, solar mass object 
inspirals into an approximately million solar mass black hole — the demands of grav- 
itational wave astronomy are particularly stringent; the promise of groundbreaking 
scientific advances — including precision tests of general relativity in the strong-field 
regime [8-10] and a better census of black hole populations — hinges on our ability 
to track the phase of their gravitational waveforms throughout the long inspiral, with 
an accuracy of better than 1 part in 10,000 [2]. This, in turn requires highly accurate, 
long time models of the orbital motion. 
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For the past two decades, these demands have stimulated an intense period of 
EMRI research among the gravitational physics community. Despite of the impres- 
sive progress made by numerical relativists towards tackling the two-body problem 
in general relativity (for reviews see Refs. [11-13]), the disparity of length scales 
characterising the EMRI regime is a significant roadblock for existing numerical rel- 
ativity techniques. Indeed, to this day EMRIs remain intractable by current numerical 
relativity methods, and successful approaches have instead tackled the problem per- 
turbatively or through post-Newtonian approximations (see Ref. [14] for a review). 
This article will focus on the first of these; by treating the smaller object as a per- 
turbation to the larger mass, the so-called “self-force approach” reviewed here has 
been a resoundingly successful tool for EMRI research. 

Within the self-force approach, the smaller mass, /i ^ M, is assumed to be 
sufficiently small that it may be used as a perturbative expansion parameter in the 
background of the larger mass, M. Expanding the Einstein equation in /i, we see 
the smaller object as an effective point particle generating a perturbation about the 
background of the larger mass. At zeroth order in /i, the smaller object merely follows 
a geodesic of the background. At first order in /i, it deviates from this geodesic due 
to its interaction with its self-field. Viewing this deviation as a force acting on the 
smaller object, the calculation of this self-force is critical to the accurate modelling of 
the evolution of the system. For the purposes of producing an accurate waveform for 
space-based detectors, and for producing accurate intermediate mass ratio inspiral 
(IMRI) models, it will be necessary to include further effects up to second perturbative 
order [15, 16]. Indeed, recent compelling work [17, 18] suggests that IMRIs — and 
even comparable mass binaries — may be modelled using self-force techniques. 

A naive calculation of the first order perturbation due to a point particle leads 
to a retarded field which diverges at the location of the particle. The self-force, 
being the derivative of the field, also diverges at the location of the particle and one 
obtains equations of motion which are not well-defined and must be regularized. A 
series of formal derivations of the regularized first order equations of motion (now 
commonly referred to as the MiSaTaQuWa equations, named after Mino et al. [19] 
and Quinn and Wald [20] who first derived them) for a point particle in curved 
spacetime have been developed [19-30], culminating in a rigorous work by Gralla 
and Wald [31] and Pound [32] in the gravitational case and by Gralla et al. [33] 
in the electromagnetic case. This was subsequently extended to second perturbative 
order by Rosenthal [34-37], Pound [38-40], Gralla [41] and Detweiler [42]. These 
derivations eliminate the ambiguities associated with the divergent self-field of a 
point particle and provide a well-defined, finite equation of motion. Building upon 
this foundational progress, several practical computational strategies have emerged 
from these formal derivations: 

• Dissipative self -force approaches: While the full first-order self-force is divergent, 
it turns out that the dissipative component is finite and requires no regularization. 
This fact has prompted the development of methods for computing the dissipa- 
tive component alone, sidestepping the issue of regularization altogether. These 
dissipative approaches fall into two categories: 
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1 . Flux methods: By measuring the orbit- averaged flux of gravitational waves onto 
the horizon of the larger black hole and out to infinity, the fact that the field is 
evaluated far away from the worldline means that no divergent quantities are ever 
encountered. This approach yields the time- averaged^ dissipative component of 
the self-force [43-50]. 

2. Local/instantaneous dissipative self-force: The time averaging element of flux 
methods can be eliminated by instead computing the local, instantaneous dissi- 
pative component of the self-force from the half-advanced-minus-half-retarded 
field [43, 51-53]. 



Both methods, however, fundamentally rely on neglecting potentially important 
conservative effects which can significantly alter the orbital phase of the system. 

• The mode-sum approach: Introduced in Refs. [54, 55], and having since been 
successfully used in many applications, the approach relies on the decomposition 
of the retarded field into spherical harmonic modes (which are finite, but not 
differentiable at the particle), numerically solving for each mode independently 
and subtracting analytically-derived “regularization parameters”, then summing 
over modes. 

• The effective source approach: Proposed in [56, 57], the approach implements the 
regularization before solving the wave equation. This has the advantage that all 
quantities are finite throughout the calculation and one can directly solve a wave 
equation for the regularized field. 

• The worldline convolution approach: First suggested in [58, 59], one computes the 
regularized retarded field as a convolution of the retarded Green function along the 
past worldline of the particle. Although the approach is the most closely related to 
the early formal derivations, it is only recently that it has been successfully applied 
to calculations in black hole spacetimes. 



For a comprehensive review of the self-force problem, see Refs. [60-63] . In this paper, 
I will review the various approaches and give details of how each one is implemented 
in practice, highlighting the advantages and disadvantages in each case. 

This paper follows the conventions of Misner et al. [64] ; a “mostly positive” metric 
signature, (— , -h, +, +), is used for the spacetime metric, the connection coefficients 
are defined by = ^^'^{gan,v + 9cny,n ~ the Riemann tensor is R'^xtiu = 

the Ricci tensor and scalar are R„/3 = 

and R = and the Einstein equations are Gap = Rap ~ \9apR = ^T^Tap. 
Standard geometrized units are used, with c = G = 1 . Greek indices are used for 
four-dimensional spacetime components, symmetrisation of indices is denoted using 
parenthesis [e.g. (ct/^)], anti-symmetrisation is denoted using square brackets (e.g. 
[a(3]) and indices are excluded from symmetrisation by surrounding them by vertical 
bars [e.g. {a\P\^)]. Latin letters starting from i are used for indices summed only 
over spatial dimensions and capital letters are used to denote the spinorial/tensorial 
indices appropriate to the field being considered. Either v or are used when 



^For the case of inclined orbits in Kerr spacetime, this is more appropriately formulated as a 
torus-average. 
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referring to a spacetime field point and z(r) or z^(r) are used when referring to a 
point on a worldline parametrised by proper time r. Finally, a retarded (or source) 
point is denoted using a prime, i.e. z'. 



2 Equations of Motion 

The formal equations of motion of a compact object moving in a curved spacetime are 
now well established up to second perturbative order. Writing the perturbed spacetime 
in terms of a background plus perturbation, ga /3 = +haf], the equations of motion 

essentially amount to those of an accelerated worldline in the background spacetime, 
with the acceleration given by a well-defined regular field which is sourced by the 
worldline. To order /i, this coupled system of equations for the worldline and its 
self-field are commonly referred to as the MiSaTaQuWa equations and are given (in 
Lorenz gauge, assuming a Ricci-flat background spacetime) by 

= -167T// J ga'{aU‘^' 9I3)I3'U^' h{x,z{T'))dT (la) 

^ia^ = (lb) 



with 



Here, /i is mass of the object, g^^'^ is the bivector of parallel transport, Ca^jd is 
the Weyl tensor of the background spacetime, and we use the trace-reversed metric 
perturbation ha/s = hap - h = 

One can also consider compact objects possessing other types of charge. For 
example, a particularly simple case is that of a scalar charge q with mass m and 
scalar field O, in which case the equations of motion are given by^’^ 

(□ — = —Airq J z{T'))dr' (2a) 

ma^ = q (2b) 

^ = (2c) 



^In the scalar case, it is important to distinguish between the self-force and the 

self-acceleration, which is given by projecting to self-force orthogonal to the worldline, = 
^9(0) F u*^u9)Fp. In the electromagnetic and gravitational cases the self-force has no component 
along the worldline and the two may be used interchangeably. 

^We assume that the mass m is small and ignore its effect on the equations of motion. 
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Here, R is the Ricci scalar of the background spacetime and ^ is the coupling to 
scalar curvature. Similarly, for an electric charge, e, one obtains equations of motion 
which are given in Lorenz gauge by 

= -4ne j Qaa'U^^' ^4ix, z{T'))dT (3a) 

ma" = (3b) 

where is the Ricci tensor of the background spacetime and is the vector 
potential. 

The key component in all instances is the identification of the appropriate regu- 
larized field on the worldline. Detweiler and Whiting identified a particularly elegant 
choice for the regularized field, written in terms of the difference between the retarded 
field and a locally-defined singular field, 

^ 4 = = ( 4 ) 

In addition to giving the physically-correct self-force, the Detweiler- Whiting regular 
field has the appealing feature of being a solution of the homogeneous field equations 
in the vicinity of the worldline. Most computational strategies essentially amount to 
differing ways of representing this singular field^ and obtaining the regularized field 
on the worldline. 



3 Numerical Regularization Strategies 



In a numerical implementation, it is essential to avoid the evaluation of divergent 
quantities. In the case of self-force calculations, both the retarded and singular fields 
diverge on the worldline so one must avoid evaluating them there. Several strategies 
for doing so have emerged over the years (see Table 1 for a summary). 

One option is to only ever evaluate finite, dissipative quantities (e.g. the retarded 
field far from the worldline or the half-advanced-minus-half-retarded field on the 
worldline). This is the basis of the dissipative methods mentioned in the introduction. 
Since these methods effectively avoid the problem of regularization, they will not be 
discussed further here; we will return to them in Sect. 4. It is worth noting, however, 
that the methods typically used by dissipative calculations are essentially the same as 
those used by mode-sum regularization for computing the retarded field, but without 
the additional regularization step. 

This leaves three regularization strategies which allow the regularized field to be 
computed on the worldline without encountering numerical divergences: worldline 
convolution, mode-sum regularization, and the effective source approach. 



^Some methods [65, 66] rely on alternative prescriptions for the singular field than that proposed 
by Detweiler and Whiting. 
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Table 1 Summary of regularization methods employed by self-force calculations in black hole 
spacetimes 



Case 


Worldline 


Mode- sum 


Effective source 


Scalar 


Schwarzschild 


Circular (apprx) [59]; 
generic (quasilocal) 
[67, 68]; 
generic [69-71]; 
static [72]; 
accelerated [73] 


Radial [74]; 
circular [75-78]; 
eccentric [79-83]; 
static [72] 


Circular [56, 57, 65, 
84-86]; eccentric [87]; 
evolving [88] 


Kerr 


Generic [68]; 
accelerated [73] 


Circular [89]; 
equatorial [90, 91]; 
inclined circular [92] ; 
accelerated [93]; 
static [94, 95] 


Circular [96]; 
eccentric [97] 



EM 



Schwarzschild 


Static [72] 


Static [72]; 
eccentric [82, 98]; 
static (Schwarzschild- 
de Sitter) [99]; 
radial (Reissner- 
Nordstrom) [100] 




Kerr 


- 


Equatorial [90]; 
accelerated [93] 


- 



Gravity 



Schwarzschild 


Generic 

(quasilocal) [101] 


Radial [102]; 
circular [103-111]; 
eccentric [82, 
112-120]; 
osculating [121] 


Circular [122] 


Kerr 


Circular 


Equatorial [90]; 


Circular [125]; 




(quasilocal) [59]; 


accelerated [93]; 


generic [126] 




branch cut [123] 


circular [119, 124] 





3.1 Worldline Convolution 



The worldline convolution method relies on a split of the regularized self-force into 
an “instantaneous” piece and a history-dependent term. The instantaneous piece is 
easily calculated from local quantities evaluated at the particle’s position, 



(^mst 

,a 



+ {dal + 



+ UaUfs) 

= K%,u-u^ = 0. 



(5a) 

(5b) 

(5c) 
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Fig. 1 Schematic 
representation of the 
worldline convolution 
method. The equations 
shown are for the case of the 
self-force on a scalar charge, 
but are representative of 
similar equations in the 
electromagnetic and 
gravitational cases. 
Reproduced from Ref. [69] 



Timelike worldline of the partiele 
Current loeation of the partiele - z(x) 

Integral in quasiloeal region 

f V„Gret{z{T),z{T'))dT' 

JTm 

Matehing point -z(Tm) 



Boundary of eausal domain where 
Hadamard form is valid 

^ First null geodesie whieh 
re-interseets the worldline 

Integral outside quasiloeal region 

f V‘"Gret{z(T),Z 

J —oo 



The history-dependent term is much more difficult to calculate, as it is given in terms 
of a convolution of the derivative of the retarded Green function along the worldline’s 
entire past-history (see Fig. 1), 

= q r V^G’^^^[x,z{T')]dT\ (6a) 

J — OO 

= e r V^G-‘,[x, dr', (6b) 

J—OO 

r dr'. (6c) 

J—OO 

The covariant derivatives here are taken with respect to the first argument of the 
retarded Green function. Likewise, the regularized self-field can be obtained from 
a worldline convolution of the retarded Green function itself; similar formulae can 
also be derived for higher derivatives. The retarded Green function appearing in these 
equations is a solution of the wave equation with a delta-function source, 

(□ - ^/?)G''^‘ = -47 t 6 ‘^{x, x'), 

~ = -4n Qaa'S‘^(x,X^), 

^G^da'P' J p, = -4 tt gaa'gi3i3’S^(x,x'), 



(7a) 

(7b) 

(7c) 
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with boundary conditions such that the solutions correspond to purely outgoing 
radiation at infinity and no radiation emerging from the horizon. 

The regularization (i.e. subtraction of the Detweiler- Whiting singular field) is 
formally achieved by the limiting procedure in the upper limit of integration, i.e. 
by cutting off the integration at r“, slightly before the coincidence point z = x. 
In practice, this is done by taking the integration all the way up to coincidence, but 
excluding the direct contribution to the Green function at coincidence (i.e. the term 
proportional to (a) in the Hadamard form for the Green function). Although the 
retarded Green function also diverges at certain other points along the past worldline 
(in particular at null-geodesic intersections where the particle sees null rays emitted 
from its past), it turns out that these are all integrable singularities (of the form 1/cr 
and (5(cr)) and the integral may be accurately evaluated to give a finite and accurate 
value for the regularized self-force. 

Despite having been proposed in the early days of EMRI self-force calculations 
[58], the worldline convolution method was largely ignored for a long time by numer- 
ical implementations (the notable exception being investigative studies [59, 127] 
which did not complete a full calculation of the self-force). A likely reason is that 
the method relies on knowledge of the retarded Green function for all points on the 
particle’s past worldline. While methods have existed for decades for computing 
portions of the retarded Green function, it turns out to be very difficult to obtain it 
accurately everywhere that it is needed for the worldline convolution. 

Thankfully, recent progress has led to two practical computational strategies, 
both of which have been successfully applied to compute the self-force in black 
hole spacetimes. The first of these is based on a frequency-domain decomposition of 
the Green function, and builds on the rich history of black hole perturbation theory 
developed over the past several decades. The second, a time-domain approach, has 
been a very recent development and shows a great deal of promise for the future. 

Independently of whether a frequency-domain or a time-domain scheme is used, 
a common problem with both is that they fail at early times when source and field 
points are close together; in the frequency-domain case the convergence is poor at 
early times, while in the time-domain case features from the numerical approximation 
pollute the data at early times. A relatively straightforward solution which has been 
successfully applied in both scenarios is to only rely on their results at late times 
and to supplement them at early times with a quasilocal Taylor series expansion. 
Provided a sufficiently early time can be chosen where both the distant past and 
quasilocal calculations converge, this yields a global approximation for the Green 
function which is sufficient for producing an accurate result for the self-force. To 
date this has been shown to be possible in the case of a scalar charge in Nariai [128], 
Schwarzschild [69, 70] and Kerr [129] spacetimes. 

Given an approximation for the Green function valid throughout the past world- 
line, it is then trivial to numerically integrate Eq. (6) to obtain the self-force (see 
Eig. 1). As mentioned previously, the divergences in the retarded Green function at 
null-geodesic intersections on the past worldline are all integrable singularities and 
do not pose a significant obstacle to accurate numerical evaluation of the integrals. 
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3.1.1 Quasilocal Expansion 

In order to obtain an approximation to the retarded Green function which is valid at 
early times, it is convenient to start with the Hadamard form for the Green function 
(in Lorenz gauge) 



Gfs’ix, x') = @-[uAB'5{a) - Vab'@{-ct)\ (8) 

where ©_ is analogous to the Heaviside step-function, being 1 when x' is in the causal 
past of V, and 0 otherwise, 6(a) is the covariant form of the Dirac delta function, 
Uab' and Vab' are symmetric bi- spinors/tensors and are regular for x' ^ x. The bi- 
scalar a (v, x^) is the Synge world function, which is equal to one half of the squared 
geodesic distance between v and x'. In particular, a(x, x) = 0 and a(x, z) < 0 
when X and z are timelike separated. Because of the limiting procedure in the history 
integral, Eq. ( 6 ), only the term involving Vab' is non-zero, and at all required points 
along the worldline ©_ = 1 = ©(—a). The problem of determining the retarded 
Green function at early times therefore reduces to finding an approximation for 
Vab'(x, -^0 which is valid for x and x' close together. 

Several methods have been developed for computing approximations to Vab'- 
Fundamentally, they rely on either the use of a series expansion, or on the use 
of numerically evolved transport equations (ordinary differential equations defined 
along a worldline). The series expansion approach has been the most fruitful to date 
with results including: leading-order coordinate expansions in Schwarzschild and 
Kerr spacetimes for scalar [ 68 , 73] and gravitational cases [101], high-order coor- 
dinate expansions in spherically symmetric spacetimes (including Schwarzschild) 
[67], formal covariant expansions in generic spacetimes [130-133], and moderately 
high-order coordinate expansions in Schwarzschild [82] and Kerr [90] spacetimes. 
The only numerical calculation I am aware of was done in [133] for generic space- 
times (with an example application in Schwarzschild spacetime). 

The series expansion method produces an expression for V(x, xO as a power 
series in the coordinate distance between x and x^ For example, for the scalar case 
in Schwarzschild spacetime it takes the form 

00 

V(x,x')= ^ Vijk(r) (t - cos (9) 

i,j,k=0 

where 7 is the angular separation of the points and the Vijk are analytic functions 
of r and M. It is straightforward to take partial derivatives of these expressions at 
either spacetime point to obtain the derivative of the Green function. Although this 
series on its own may be sufficient for use in the quasilocal component of a world- 
line convolution, it turns out that some simple tricks allow for a vast improvement in 
accuracy. It turns out that, because V (x, xO diverges at the edge of the normal neigh- 
bourhood, the series approximation benefits significantly from Fade resummation 
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which incorporates information about the form of the divergence [67]. One minor 
caveat is that since Fade re-summation is only well defined for series expansions in 
a single variable, it is necessary to first expand r' and 7 in a Taylor series mt — 
using the equations of motion to determine the higher derivatives appearing in the 
series coefficients. Then, with V{x,x') written as a power series m t — t' alone, 
a standard diagonal Fade approximant provides an accurate representation of the 
Green function in the quasi-local region. 



3.1.2 Frequency Domain Methods 

Frequency domain methods for computing the retarded Green function rely crucially 
on the separability of the wave equation. In the scalar, Schwarzschild case that rep- 
resents the current state-of-the art^ [69] (also see [128] for a related calculation in 
Nariai spacetime), this can be achieved by writing the Green function as a sum of 
spherical harmonic and Fourier modes 



^ Y rOQ+ie 

= — ^/’«(cos7)— / gdr,r'- (10) 



Here, e > 0 is a formal parameter to ensure the correct boundary conditions are satis- 
fied for a retarded Green function. Substituting this into the wave equation, Eq. (7a), 
one obtains an independent set of ordinary differential equations for gi(r, r'\uj), one 
equation for each i and uj, 

ge(r,r';oj) = -S(r^ - r'd, ( 11 ) 



/ 2M\r^(^ + l) 2M1 

V.(r) ^1 - . 

Here, r* = r -h 2M In — l) is the radial tortoise coordinate. 






^More generally, Teukolsky [134, 135] showed that the field equations may be separated in Ken- 
spacetime in the gravitational case by making use of the spin- weighted spheroidal harmonics in place 
of the spherical harmonics. A series of works — ^pioneered by Regge and Wheeler [136] and improved 
upon by others [137-143] — achieved a similar separation in the Schwarzschild gravitational case 
by making use of tensor spherical harmonics. However, separability alone is not sufficient and 
there remain some technical issues which have yet to be solved before solutions of the Teukolsky 
or Regge- Wheeler equations could be used in a worldline convolution approach. Most important 
is the issue of gauge; the MiSaTaQuWa equations, Eqs. (5) and (6), were derived in the Lorenz 
gauge, whereas solutions of the Teukolsky and Regge- Wheeler equations are in a gauge different 
from Lorenz gauge. Lortunately, recent work [119] has resolved many of the conceptual issues 
associated with gauge choice. 
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Given two linearly-independent solutions, p{r\uo) and q{r\uo), of the homoge- 
neous version of (11), gi(r, r'\ uo) is given by 



gi{r, r;u) 



W(p,q) 



(13) 



where r> = max(r, r'), r< = min(r, r') and W(p,q) is the Wronskian. One is then 
faced with the integral over frequencies in the inverse Fourier transform appearing 
in (10). This may be achieved through straightforward integration along the real-cc; 
axis (see Fig. 2); the only caveat is that the formally-infinite integral over frequencies 
should be cut off at some finite maximum frequency using a smooth window function, 
for example 



^ noo+ie ^ 

gi(r, r'\ t - t') = — / gi(r, r'\ - erf[2(w - Wmax)])/2c?w. 

Z7T y_oo+/e 



(14) 



Alternatively, as was done in [69], the integration contour can be deformed 
into the complex frequency plane following a proposal by Leaver [144, 145]. In 
Schwarzschild spacetime, ggir, uo) has simple poles in the lower semi-plane at 



Frequency domain Green function, /=0 mode ^ , x- , « ^ 

Time domain Green function, /=0 mode 





Frequency domain Green function, /=10 mode 




Time domain Green function, /=10 mode 




Fig. 2 Spherical harmonic modes of the Schwarzschild scalar Green function. Left the frequency 
domain Green function, gi(r, r'; uj), as a function of real frequency for r = r' = lOM, 1 = 0 
{top) and I = 10 (bottom). Right the corresponding time domain Green function, gi(r, r'\ t — t') 
computed by Fourier transforming the frequency domain Green function using Eq. (14) with o;niax = 
8.5 (blue, solid line), and by using the time domain methods described in Sect. 3.1.3 (orange, 
dashed line) 
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Fig. 3 Contour deformation in the complex-frequency plane. The residue theorem of complex 
analysis allows one to re-express the integral over (just above) the real line of the Fourier modes, 
gi(r, r'; a;), as an integral over a high-frequency arc plus an integral around a branch cut and a sum 
over the residues at the poles. Reproduced from Ref. [69] 



the quasinormal mode frequencies, along with a branch cut starting at cj = 0 and 
continuing along the negative imaginary axis (see Fig. 3). The deformed frequency 
integral is therefore given by an integral over a high-frequency arc, an integral around 
a branch cut, and an integral over the residues at the poles. ^ 

The high-frequency arcs can be disregarded as they are likely to only contribute 
at very early times, where the quasilocal approximation can be used. There are 
well-established methods for accurately computing the location of the poles (qua- 
sinormal mode frequencies) and the residues at the poles. The biggest difficulty in 
the frequency-domain approach is in evaluating the branch cut integral, about which 
very little was known until recently (other than asymptotic approximations for e.g. 
large radius or late times). Substantial recent progress has established methods for 
calculating this branch cut contribution [146]; these methods were used in [69] to 
compute the self-force in Schwarzschild spacetime. 

The final step in the frequency domain approach is to sum over spherical harmonic 
modes to produce the full Green function. Here, the distributional parts of the full 
Green function can cause poor convergence in the mode-sum. Fortunately, there is 
a straightforward solution to this problem: by smoothly cutting off the mode sum at 
large f , 



-I ^-max 

x') = — - ^ Pi>(cos'f)gi{r, r'; 1 - 
£=0 



(15) 



one obtains a mollified retarded Green which is appropriate for use in a self-force 
calculation, and whose sum over I converges. Empirically, it has been found that 
choosing fcut ~ ^max/5 gives good results. 



^In the Kerr spacetime there are indications that there may be additional branch cuts to consider 
[123]. 
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3.1.3 Time Domain Approaches 

The frequency domain approach to computing the retarded Green function has several 
shortcomings. It relies on relatively difficult technical calculations and has poor 
convergence properties at early times and in scenarios where the worldline is not 
well-represented by a discrete spectrum of frequencies. Recent work has shown that 
the Green function may also be accurately computed (at least for the purposes of self- 
force calculations) in the time domain using straightforward numerical evolutions. 
A time domain calculation sidesteps issues related to a wide frequency spectrum and 
appears to exhibit much better convergence properties at early times. 

There are two closely-related proposals for computing the Green function in the 
time domain. In [7 1] , Eq. (7a) was numerically solved as an initial value problem, with 
the delta-function source approximated by a narrow Gaussian. A reformulation of 
this approach as a homogeneous problem with Gaussian initial data was subsequently 
given in [70]. It was found that these “numerical Gaussian” approaches are able to 
approximate the retarded Green function remarkably well in a large region of the 
space required by worldline convolutions. The size of the Gaussian limits the scale of 
the smallest features which can be resolved, but it turns out that this is not significantly 
detrimental to a self-force calculation. 

The only regimes where the numerical Gaussian approach is not well suited to 
computing the retarded Green function are at very early and very late times. At early 
times the direct (i(cr(v, x)) term (which must be excluded from the worldline convo- 
lution of the retarded Green function) is smeared out and is difficult to isolate from 
the rest of the Green function. There is no conceptual difficulty at late times, but the 
reality of a numerical evolution is that it can only be run for a finite time. Fortunately, 
both of these issues are easily overcome; the former by using the quasilocal expan- 
sion at early times, and the latter by using a late-time expansion of the branch cut 
integral (see Fig. 4). 

In the time domain approach, each numerical time domain evolution gives the 
Green function G(xo, x') for a single base point xq, and for all source points, x. As 
a result, a single numerical calculation can only be used to compute the self-force 
at a single spacetime point, but it can be computed for any past- worldline ending up 
at that point. This is in contrast to other self-force methods, where a single orbit is 
considered at a time, but a single calculation gives the self-force at all points on the 
orbit. The problem of efficiently spanning the parameter space of base points xq is 
an ideal application of reduced order methods [129, 147]. 



3.2 Mode-Sum Regularization 

The mode-sum regularization scheme, first proposed by Barack and Ori [54], 
has proven highly successful as a computational self-force strategy, having been 
applied to the computation of the the self force for a variety of configurations in 
Schwarzschild [74-81, 83, 102, 107-110, 113, 114, 148, 149] and Kerr [89, 91, 
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Schwarzschild retarded Green function along an eccentric geodesic 





0 50 100 150 200 250 300 



Fig. 4 Top Matched Green function along an eccentric orbit (eccentricity e = 0.5 and semilatus 
rectum p = 7.2) in Schwarzschild spacetime. The full Green function {black) is constructed by 
combining a quasilocal approximation at early times {orange, dashed line) with a time domain 
calculation at intermediate times {blue, dashed line) and a late-time branch cut approximation at 
late times {green, dashed line). Bottom Integrating the retarded Green function (excluding the direct 
part) up to some time point on the past worldline gives the contribution to the regularized self-field 
from all points on the past worldline up to that time. With an eccentric orbit of period ~3 17M, we 
see that a good approximation to the self-field is obtained by including the contributions from less 
than one full orbit. Note that the formal divergences of the Green function are integrable and do not 
cause significant numerical difficulty in computing the self-force 

92, 124, 150] spacetimes. The success of the method hinges on the fact that the first 
order retarded field diverges in a way which can be effectively smoothed out by a 
spectral decomposition in the angular directions. More specifically, the divergence 
appears as an odd power of \/s, where s'^ = {g^^ + is an appropri- 

ate measure of distance from the worldline. The result is that the 1 /s divergence of 
the field near the worldline turns into an infinite sum of modes, each of which are 
individually finite (but possibly discontinuous) on the worldline. The divergence of 
the field then manifests itself through the failure of the (infinite) sum over modes 
to converge. Conveniently, this odd-in- 1/^- property of the retarded field also holds 
for its derivatives, both the first derivatives required for the self-force and higher 
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derivatives which are useful for computing higher-multipole gauge invariants [104, 
105]. As a result, an arbitrary number of derivatives of the first order retarded field 
is represented by an infinite sum of modes, each of which are individually finite 
(but possibly discontinuous) on the worldline. The trade-off is that as the number 
of derivatives increases the divergence of the sum over modes becomes increasingly 
strong. 

Since the individual modes are finite, numerical calculations of the retarded field 
and its derivatives can be done in the reduced {t,r) space without encountering any 
numerical divergences. The remaining piece of the problem is a method for render- 
ing the divergent sum over modes finite. An analytic decomposition of the angular 
dependence of the Detweiler- Whiting singular field yields “regularization parame- 
ters” which may be subtracted mode-by-mode from the numerical retarded field 
values. Provided all parts of the Detweiler- Whiting singular field which don’t vanish 
on the worldline are included in the calculation, the regularized sum over modes is 
convergent on the worldline and one never encounters any numerical divergences. 

It is important to note that the use of the Detweiler- Whiting singular field is not 
merely a convenience; it also provides well-motivated physical grounds to justify the 
mode-sum regularization approach. Other ad-hoc approaches based on identifying 
the asymptotically divergent contributions to the mode- sum often produce the correct 
regularization parameters, but their use is difficult to justify rigorously and can easily 
lead to incorrect results. Crucially, there is no way of knowing whether such an ad-hoc 
regularization procedure is producing a physically correct result, or if important 
contributions are being overlooked. They should therefore not be relied on without 
extreme care. 

Unfortunately, despite its success in first order calculations, mode-sum regular- 
ization alone is not an effective tool for computing the second order self-force. The 
reason for this is straightforward: the second order retarded field contains divergent 
terms which appears in the form of even powers of l/^-. Intuitively, this arises from 
the fact that the second order field contains terms which depend quadratically on the 
first order field. Unlike the odd-in- 1/^- case, the angular decomposition oi\/s^ leads 
to individual modes which diverge logarithmically as the worldline is approached. 
Fortunately, all is not lost for the mode-sum method as a computational tool at sec- 
ond order. Provided the leading order logarithmic divergence is subtracted by some 
other means (for example, using the effective source approach), regularization para- 
meters may be used to accelerate the rate of convergence of the mode sum. Such a 
hybrid scheme complements the generality of the effective source approach with the 
computational efficiency of mode-sum regularization. 



3.2.1 Regularization Parameters 

The computation of regularization parameters has been addressed by a series of 
calculations stemming from the original Barack-Ori derivation. Barack and Ori’s 
original work gave the first two self-force regularization parameters for scalar, elec- 
tromagnetic and Lorenz-gauge gravitational cases in both Schwarzschild and Kerr 
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spacetimes. These are sufficient for computing the regularized self-force, but they 
yield mode- sums which have relatively poor quadratic convergence with the num- 
ber of modes included. Since the Detweiler- Whiting regularized field is a smooth, 
homogeneous function in the vicinity of the worldline, its mode-sum representation 
converges exponentially. This spectral convergence is spoiled by the fact that only 
the portion of the Detweiler- Whiting singular field which does not vanish on the 
worldline is subtracted by the leading order regularization parameters. The mode 
decomposition effectively contains information about the extension of the field off 
the worldline to the entire two-sphere, so the regularized field contains residual pieces 
of the Detweiler- Whiting singular field off the worldline, but on a two-sphere of the 
same radius. By deriving higher-order regularization parameters one can subtract 
these residual pieces order-by-order, leaving a mode- sum which is more and more 
rapidly convergent (see Fig. 5). The derivation of these higher-order parameters is 
closely related to the computation of the quasilocal expansion of the Green function 
and has been addressed in a series of papers: the first higher order parameter was 
given in [76] for the case of a scalar charge on a circular orbit in Schwarzschild 
spacetime, and for eccentric geodesic orbits in [80]. This was subsequently extended 
by several further orders for equatorial geodesic motion in the scalar, electromagnetic 
and gravitational cases in both Schwarzschild [82] and Kerr [90] spacetimes. Recent 
work has also produced parameters for accelerated worldlines [72, 93]. 

Within these derivations of high-order regularization parameters, a subtle ambi- 
guity appears through the elevation of the four- velocity from a quantity defined 
on a worldline to a quantity defined everywhere on the two- sphere. A natural, 
covariant choice is to define this off- worldline extension through parallel transport, 
u^. However, in practical calculations it is often convenient to make a 
coordinate choice. For example, a common choice is to define the extension in terms 
of “constant coordinate components”, i.e. to define such that its components in 
some coordinate system have a constant value everywhere on the two-sphere. This is 




Fig. 5 Spherical harmonic modes of the self-force for a point scalar charge on a circular orbit of 
radius ro = 6M in Schwarzschild spacetime. By subtracting analytically determined high-order 
“regularization parameters”, the sum over modes is rendered more and more rapidly convergent. 
Each regularization parameter, , behaves asymptotically for large-£ as 1 /£" 
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a perfectly valid choice, as is any other choice where agrees with the actual four- 
velocity when evaluated on the worldline. The only caveat is that the regularization 
parameters beyond the leading two orders change depending on the particular choice 
of extension. As a result, in order to use higher-order regularization parameters it is 
essential that a compatible choice of off-worldline extension is used in the retarded 
field calculation. 



3.2.2 Choice of Basis and Gauge 

There are two other factors which must be considered in the mode-sum scheme: the 
choice of angular basis functions for the spectral decomposition and the choice of 
gauge in the electromagnetic and gravitational cases. 

The choice of basis functions is typically motivated by their ability to produce 
separability of the retarded field equations; an appropriate choice of basis functions 
yields an independent set of equations in (t, r) space for each individual mode. For 
the case of a scalar charge in Schwarzschild spacetime, the appropriate basis func- 
tions are the standard spherical harmonics. In the Kerr case the spheroidal harmonics 
are required for separability. For electromagnetic and gravitational cases, there are 
several choices. For Schwarzschild spacetime one can choose between a tensor har- 
monic basis and a basis of spin- weighted spherical harmonics. In the Kerr case only 
the spin- weighted spheroidal harmonics are known to yield separability [134, 135]. 

This choice of angular basis also affects the regularization parameters. The para- 
meters for scalar spherical harmonic modes, tensor harmonic modes, spin- weighted 
spherical harmonic modes and spheroidal harmonic modes are all potentially differ- 
ent. However, it is always possible project the tensor, spin- weighted, or spheroidal 
harmonic modes onto the scalar spherical harmonic basis. In fact, since the regular- 
ization parameters are most easily obtained for the scalar spherical harmonic basis, 
calculations of the retarded field are typically done using the computationally most 
convenient basis and the result is then projected onto the scalar spherical harmonics 
before the regularization and sum-over-modes steps are done [113]. 

A more difficult issue in the electromagnetic and gravitational cases is the choice 
of gauge. The Detweiler- Whiting singular field is defined in Lorenz gauge (since it 
is derived from a Lorenz gauge Green function), but numerical calculations of the 
retarded field are more easily done in either radiation or Regge- Wheeler gauge. The 
existence of tensor spherical harmonics makes a Lorenz-gauge calculation possible, 
if somewhat cumbersome in the Schwarzschild case [111, 118, 120]. One obtains 
a coupled set of 10 equations for the metric perturbation, but there is no coupling 
between different different tensor-harmonic modes. Unfortunately, this does not hold 
for the Kerr case as there are no known tensor spheroidal harmonics. Rather than 
trying to derive tensor spheroidal harmonics, a better approach is to work with the 
relatively straightforward Teukolsky equation in radiation gauge [124] . The difficulty 
then is in identifying the appropriate regularization parameters, particularly since the 
gauge transformation from Lorenz gauge is itself often singular. This remained an 
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Open problem until recently, when an understanding of how to apply the mode- sum 
regularization scheme in radiation gauge was finally established [119] and applied 
in a self-force calculation [108]. 



3.2.3 Mode-Sum Regularization in the Frequency Domain 

The mode-sum scheme provides a method for computing the regularized self-force 
using numerical data for the modes of the retarded field in the reduced (r, r) space. 
There are, however, several possibilities for computing this numerical data. One 
option relies on a further decomposition of the time dependence into Fourier- 
frequency modes, in an analogous way to the frequency-domain Green function 
described in Sect. 3.1.2 above. Using a spin-weighted spheroidal harmonic basis, 
this leads to a radial equation for the Teukolsky function, 

A-^^(A^+l^) + y(r)/? = T(r) ( 16 ) 

with the potential given by 



V{r) = 



- 2is(r - M)K 
A 



Airuos — A. 



(17) 



Here, A = — 2Mr K = (r^ -h a^)uo — am, X is an eigenvalue of the 

spheroidal equation. 



- 1 d 
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sin^ 0 — 
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2au)s cos 6* + ^ + 2mau) + 



= 0 



( 18 ) 



and a and M are the Kerr spin and mass parameters. For a = 0, A = — 

and this reduces to the Schwarzschild wave equation decomposed into spin-weighted 
spherical harmonics (in radiation gauge for the gravitational case), while for 5' = 0 
it reduces to the scalar wave equation. A decomposition of the point-particle source 
term yields a source involving 6(r — rp) (and in some cases its derivative), where 
Vp is the radial location of the particle. In practice, the frequency domain mode- 
sum approach proceeds in the same way as for the Green function: two independent 
solutions of the homogeneous radial equation are obtained and are matched at the 
particle’s location.^ Then, the distributional sources do not introduce any numerical 
difficulty as they simply appear as jumps when matching the homogeneous inner and 
outer solutions. 



^For eccentric orbits where the particle can not be considered to be at a single radial location in 
the frequency domain this matching must be modified slightly using, for example, the method of 
extended homogeneous solutions [112, 116]. 
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The solutions of the Teukolsky equation for a given {I, m,uj) can be obtained 
either through straightforward numerical integration of the radial ordinary differen- 
tial equation (typically with some modifications to improve numerical accuracy [151, 
152]) or as an approximation in the form of an infinite convergent series of hyperge- 
ometric functions. The latter method is based an idea originally developed by Leaver 
[153] and now commonly referred to as the “MST” method, after Mano et al. [154] 
who reformulated it into its current form. It provides an efficient and highly accu- 
rate method for computing solutions of the radial equation. For example, recent 
results have used it to compute solutions accurate to several hundred decimal places, 
allowing the solutions to be used to determine previously unknown high-order post- 
Newtonian parameters [155-159] . For a comprehensive review of the MST approach, 
see the living review by Sasaki and Tagoshi [160] and references therein. 

The frequency-domain approach is particularly appropriate in scenarios where 
the worldline is well represented by a discrete spectrum involving a small number of 
frequencies. In such cases, mode- sum regularization is by far the optimal choice and 
is unparalleled in its accuracy and computational efficiency [161]. The prototypical 
example is a circular orbit, in which only a single frequency is present. In the case of 
eccentric equatorial orbits (and inclined circular orbits in the Kerr case), there are two 
fundamental frequencies, and also an infinite number of higher harmonics produced 
from combinations of the fundamental frequencies. For mildly eccentric orbits this 
does not cause a great deal of difficulty. However, for more eccentric orbits (with 
eccentricities ^ > 0.5) an increasingly large number of frequencies must be included 
and the competitive advantage of the frequency domain approach is lost [161]. Even 
worse, generic geodesic orbits in Kerr spacetime have three fundamental frequencies 
and the computational difficulty is so high that a calculation has yet to be attempted. 
Similarly, unbound orbits and other cases such as radial infall are not well suited 
to frequency domain methods. Apart from these deficiencies, the frequency domain 
mode-sum approach has been highly successful for producing results. 



3.2.4 Mode-Sum Regularization in the Time Domain 

The mode- sum scheme may also be applied in the time domain by skipping the 
Fourier decomposition step and instead solving a set of 1 -h ID partial differential 
equations in (t, r) space. The main difficulty then is in appropriately handling the 
distributional source term which has the form (i(r — rp (t)). One solution, used in [49, 
79, 98, 113, 149, 162], is to use a discretised representation of a delta function and 
to construct the computational grid such that the worldline only ever passes between 
grid points. 

An alternative approach is to reformulate the problem in an analogous way to 
the frequency-domain method. By splitting the computational domain into two 
domains — one either side of the particle — the delta-function source can be refor- 
mulated in terms of a jump in the fields and their derivatives at the interface of the 
two domains. This method is well suited to highly-accurate spectral numerical meth- 
ods as all of the numerically evolved fields are smooth functions. It was implemented 
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using discontinuous-Galerkin methods in [106, 115] and using Chebyshev pseudo- 
spectral methods in [78, 81]. Eccentric orbits present a small additional complexity 
in this case as the particle must always lie on the domain interface. This is easily 
achieved by introducing a time-dependent mapping between the computational and 
physical coordinates of the system [81, 106]. 



3.2.5 Limitations of the Mode-Sum Regularization Scheme 

Despite its resounding success to date, the mode-sum regularization scheme has some 
unfortunate disadvantages which make it ill-suited as a general-purpose method for 
self-force calculations: 

1 . Its application in the Kerr case is only straightforward in the frequency domain, 
since the field equations are not separable in the time domain^ 

2. It relies on the use of Lorenz gauge for regularization in the gravitational case. 
This is not a major issue in the Schwarzschild case since the tensor spherical 
harmonics may be used to decouple the Lorenz gauge field equations in the 
angular directions (leaving 10 coupled 1 -h ID equations for each I, m mode). 
However, there are no know tensor spheroidal harmonics which would be required 
for the Kerr case, and even if there were they would likely not be applicable 
in the time domain (again, it is conceivable that a coupled system of equations 
involving tensor spherical harmonics could be used, but the coupling would result 
in considerable complexity). 

3. It is not applicable beyond first perturbative order, since the modes of the second 
order perturbation diverge logarithmically near the worldline. 

The first two issues are not necessarily showstoppers. There have been several 
attempts at 1 -h ID time-domain implementations using coupled spherical-harmonic 
modes in the Kerr case [163, 164], and recent progress on reformulating mode-sum 
regularization for radiation gauge has clarified the regularization issue [119]. The 
third point, however, appears insurmountable. Lor these reasons, among others, the 
effective source method, described in the next section, was developed. 



3.3 Effective Source Approach 



Proposed in 2007 as a solution to the shortcomings of mode-sum regularization, the 
effective source approach^ provides an alternative method for handling the diver- 
gence of the retarded field. Rather than first computing the retarded field and then 



^It may still be possible to use mode-sum regularization for the Kerr case in the time domain 
by decomposing the field equations into spherical harmonics and evolving the resulting infinitely 
coupled set of 1 + ID partial differential equations in a similar manner to the Schwarzschild case. 
^Note that the effective source proposed by Lousto and Nakano [65] is similar in spirit, but differs 
in that it is not derived from the Detweiler- Whiting singular field. 
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subtracting the singular piece as a post-processing step, one can instead work directly 
with an equation for the regular field. This idea — independently proposed by Barack 
and Golbourn [56] and by Vega and Detweiler [57] — has the distinct advantage of 
involving only regular quantities from the outset, making it applicable in a wider 
variety of scenarios than the mode-sum scheme. In particular, since it does not rely 
on a mode decomposition of the retarded field, it can be used by any numerical pre- 
scription for solving the retarded field equations, whether in the frequency domain 
(where the method is really just a generalisation of mode-sum regularization) or in 
the time domain as a 1 -h ID, 2 + ID or even 3 + ID problem. 

The basic idea is to use the split of the retarded field into regular and singular 
pieces, Eq. (4), to rewrite the field equations, Eqs. (la), (2a) and (3a) as equations for 
the Detweiler- Whiting regular field. 



If the singular field used in the subtraction is exactly the Detweiler- Whiting singular 
field, then the two terms on the right hand side of this equation cancel and the regular- 
ized field would be a homogeneous solution of the wave equation. Unfortunately, one 
typically does not have access to an exact expression for the singular field. Indeed, 
the Detweiler- Whiting singular field is only defined through a Hadamard parametrix 
which is not even defined globally. Instead, the best one can typically do is a local 
expansion which is valid only in the vicinity of the worldline. Borrowing the lan- 
guage of Barack and Golbourn, we refer to an approximation to the singular field 
as a “puncture” field, ~ ~ A^, ~ Then, the corresponding 

approximate regular field — referred to as the “residual field” — is no longer a solution 
of the homogeneous equation, but instead is a solution of the sourced equation with 
an effective source (see Eig. 6) which is defined to be the right-hand side of Eq. (19) 
with the puncture field substituted for the singular field 



(□ - = (□ - 



(19a) 



(nSa^ - R^)A^ = (nSc^ - R^KAf - Ap 

= -47re j gaa'U°"'SA(x, z(t'))cIt' - {U5a^ - Rp(Ap, (19b) 



= -167T// j ga'(aU°''gi3)l3'U^'S4(x, Z(T'))dT' - + 2CoXp^){hp). 

(19c) 




(20a) 



508 



B. Warden 




10 



110 



110 



Fig. 6 The effective source for a scalar particle on a circular orbit in Schwarzschild spacetime. 
The source is generically non- smooth in the vicinity of the worldline (left), but the smoothness can 
be improved by incorporating higher-order parts of the Detweiler- Whiting singular field into the 
source 



Note that the presence of a distributional component of the source on the worldline 
is merely a formal prescription; in practice the puncture field is chosen so that it 
exactly cancels this distributional component on the worldline. This effective source 
is then finite everywhere, but has limited differentiability on the worldline. This 
makes it well- suited to numerical implementations since no divergent quantities are 
ever encountered. The only numerical difficulty arises from the non-smoothness of 
the source in the vicinity of the worldline (see Fig. 6), which leads to numerical noise 
in the computed self force. The noise can be reduced by making the source smoother 
using higher-order parts of the Detweiler- Whiting singular field. As shown in Fig. 7, 
at the same numerical resolution a higher order source (C^ in this case) eliminates 
the vast majority of the numerical noise that is present when using a lower order 
source (C^ in the case in the figure). The cost of this improved accuracy is a source 
which is considerably more complicated, and costly to compute in a numerical code. 

An additional level of complexity arises from the fact that the puncture field is 
defined only in the vicinity of the worldline. To avoid ambiguities in its definition far 
from the worldline, one must ensure that the puncture field goes to zero there. This 
is most easily achieved by multiplying it by a window function, W, with properties 
such that it only modifies the puncture field in a way that its local expansion about 
the worldline is preserved to some chosen order. In a first-order calculation of the 
self-force, it suffices to choose W such that W(v^) = 1, W\xp) = 0, W^\xp) = 0 
and W = 0 far away from the worldline. The residual field (see Fig. 8) then obeys 




(□ - 

- R^)Af = Sf 



(21a) 

(21b) 

(21c) 
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Scalar self-force for e=0.5, p=7.2 geodesic 




X 

Fig. 7 Radial self-force for a scalar particle on an eccentric orbit in Schwarzschild spacetime, 
computed with a 3 -t- ID implementation of the effective source scheme. Here, the independent 
variable x is a “relativistic anomaly parameter” defined through r = pM/{\ + ^cosx), with 
p the semilatus rectum and e the eccentricity. The high-frequency errors using a continuous but 
non-differentiable source {blue) are dramatically decreased by using a twice differentiable source 
{orange) obtained from a higher-order approximation to the Detweiler- Whiting singular field. For 
reference, a highly accurate value computed using frequency-domain mode-sum regularization is 
also included {dashed, black). Figure based on version presented in Ref. [165] 



Fig. 8 The t = \,m = \ 
spherical-harmonic mode of 
the residual field, for a 
scalar point particle on a 
circular orbit in 
Schwarzschild spacetime. 
This was produced in [86] 
using a frequency-domain 
effective source approach 




and has the properties 

= Sa<^^(Xp), 

d)''^s(x) = for x^supp(>V), (22a) 



A’^^^xp) = Alixp), V^A^\xp) = VaAliXp), 
= Afix) for X ^ supp(W), 



(22b) 



510 



B. Warden 



Kl^^) = ^ ^ supp(W). (22c) 

As the residual field coincides with the retarded field far from the particle we 
can use the usual retarded field boundary conditions when solving Eqs. (21a)-(21c). 
The details of a numerical implementation of the effective source approach are then 
much the same as for the Green function and mode- sum regularization schemes. 
One can use either a frequency domain or a time domain method for solving the field 
equations, the key differences now being that there is no restriction to 1 -h ID, and 
that the effective source must be included as a source term. A more thorough review 
of the effective source approach — including a detailed description of methods for 
obtaining explicit expressions for the effective source — can be found in Refs. [126, 
166]. 

The effective source approach has been successfully applied in the frequency 
domain [86], and in the time domain in 1 -h ID [57], 2 -h ID [84, 96, 122] and 3 -h ID 
[85, 87, 88] contexts. It has also been formulated — but not yet implemented — in the 
gravitational case to second order in the mass ratio [40, 41]; for the second order 
problem it is currently the only viable computational strategy. Since the effects of 
the second-order metric perturbation will be very small — being suppressed by two 
orders of the mass ratio relative to test body effects — it is likely that a highly accurate 
numerical scheme will be required, suggesting a frequency domain treatment of the 
problem where one encounters ordinary differential equations (ODEs) which are 
relatively easy to solve numerically to high accuracy. 



4 Evolution Schemes 

The calculation of the self-force is only the first stage in the production of an inspiral 
model. Another critical component is a scheme for evolving the orbit using this self- 
force information. Various approaches have been proposed, each of which brings 
with it its own advantages and disadvantages. Approaches which make approxima- 
tions in the self-force used to drive the inspiral can give substantial decreases in 
the computational cost of an inspiral simulation, but come at the cost of ignoring 
potentially relevant physical effects. 



4.1 Dissipation Driven Inspirals 



A straightforward model for the inspiral can be obtained from energy balance consid- 
erations. Using a flux calculation — in which fluxes of energy and angular momen- 
tum are obtained by evaluating the point-particle retarded field near the horizon 
and out at infinity — the entire issue of regularization is avoided and one obtains 
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Fig. 9 Radial {left) and time (right) components of the self-force through one radial cycle, for 
three different geodesic orbits in Schwarzschild spacetime. Solid lines indicate the full self-force 
and dashed lines indicate the conservative-only (left) or dissipative-only (right) pieces. Arrows 
denote the direction of the geodesic motion. Reproduced from Ref. [87] 



an approximation to the contributions to the inspiral coming from dissipative self- 
force effects. However, this is inadequate for capturing all relevant effects from the 
first-order self-force. Being a dissipative approximation, it completely misses all 
conservative corrections to the motion. Furthermore, there is no well-defined way of 
associating fluxes far from the worldline with an instantaneous local self-force on 
the worldline; a flux model can only be used to drive an inspiral in a time-averaged 
sense, ignoring some potentially important dissipative contributions. Nevertheless, 
the relative straightforwardness and computational efficiency of their implementa- 
tion have made flux models a compelling approach to assessing qualitative features 
of self-force driven inspirals. These “kludge” models have been used to produce 
kludge waveforms for EMRI systems which capture at least some of the qualitative 
physical effects [44, 45, 48, 49, 162]. 

To improve on the flux-based dissipative model, one can instead make use of a 
half-retarded minus half-advanced scheme [43, 51-53] to compute a local, instanta- 
neous dissipative self-force (see Fig. 9). This approach captures all dissipative effects 
responsible for driving the inspiral, but still neglects small but potentially important 
conservative corrections to the orbital phase of the system [121]. 



4.2 Osculating Self-Forced Geodesics 

In order to account for both conservative and dissipative effects from the first order 
self-force, it is essential to build an inspiral model around a full calculation of the 
local self-force. This, however, still leaves some flexibility in the choice of method 
for computing this local self-force. One option, based on the osculating geodesics 
framework [167] is particularly compelling as it allows for dramatic improvements 
in computational efficiency by separating the coupled problem of simultaneously 
solving for the orbit and for the regularized retarded field into two independent. 
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largely uncoupled computational problems. The basic idea is to perturbatively expand 
the worldline about a geodesic of the background spacetime, z(r) = zo(^) + * * * • 
Then, the self-force at first order is a functional not of the evolving orbit z(r), but 
of the geodesic orbit, zo(^), which is instantaneously tangent to the worldline. This 
expansion is valid in the adiabatic regime, where the orbit is evolving slowly and the 
difference between the geodesic and evolving orbits is small; the error introduced 
by the approximation appears at the same order in the equations of motion as a 
second-order perturbative correction to the field equations. 

The improvements in computational efficiency brought about by the osculating 
geodesics approximation are dramatic. The problem of self-consistently computing 
the regularized self-force coupled to an arbitrarily evolving orbit is reduced to the 
much simpler problem of determining the self-force for geodesic worldlines. Since 
orbits of black hole spacetimes are parametrised by at most three conserved quantities 
(energy and angular momentum in the Schwarzschild case, supplemented by the 
Carter constant in the Kerr case), it is computationally tractable to span the entire 
parameter space of moderately-eccentric geodesic orbits. Even better, the highly- 
accurate frequency domain mode- sum method can be used because bound geodesic 
orbits are efficiently represented by a small frequency spectrum. In Ref. [121] this 
approach was explored over an entire radiation-reaction timescale for moderately- 
eccentric inspirals in the Schwarzschild spacetime. This was achieved by tabulating 
the values for the self-force in a relevant portion of the energy-angular momentum 
phase space of geodesic orbits, and using an interpolated model of the tabulated 
results to drive an orbital inspiral. 



4.3 Self-Consistent Evolution 

Unfortunately, the adiabatic approximation responsible for the dramatic improve- 
ments in computational efficiency brought by the osculating geodesics framework 
also introduces errors in the equation of motion at second perturbative order, making 
the method inadequate for the purposes of precision EMRI astrophysics. One pos- 
sible solution, implemented in [88] for the scalar field case, is to avoid expanding 
out the worldline and instead evolve the self-consistent coupled system of equations, 
Eqs.(la) and (lb); (2a), (2b) and (2c); or (3a) and (3b). This is a computation- 
ally much more difficult problem as the field equations must be solved in the time 
domain and one cannot rely on an efficient off-line tabulation of self-force values. 
This self-consistent evolution can in principal be implemented using a time-domain 
mode-sum scheme, but in practice implementations have instead used the effec- 
tive source approach since that will be necessary at second perturbative order (see 
Sect. 3.2 for an explanation of this issue). 



^®See also Ref. [168] for an approximate version which assumes a sequence of quasi-circular orbits. 
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While a self-consistent evolution incorporates all effects contributing to the first 
order self-force, and only neglects second order contributions from the second order 
field equations, this comes at the cost of computational efficiency. Whereas an oscu- 
lating geodesics framework can evolve a large number ("^10,000) of orbits with ease 
once the initial off-line tabulation phase is complete 1121], the self-consistent scheme 
requires a long calculation of the solutions of the first-order field equations for each 
new orbit. In reality, this limits the practicality of the scheme to tens of orbits using 
current methods. The self-consistent evolution scheme is therefore most valuable as 
a benchmark against which other, less accurate but more efficient methods can be 
validated. In fact, comparisons between the self-consistent and osculating geodesics 
scheme for the scalar case indicate that the osculating geodesics scheme performs 
remarkably well 1169]. 



4.4 Two-Timescale Expansions 



The osculating geodesics scheme is fast, but inaccurate, while the self-consistent 
evolution scheme is accurate, but slow. This begs the question of whether there is a 
middle ground which incorporates most of the accuracy of a self-consistent evolution 
while maintaining the computational efficiency of the osculating geodesics scheme. 
One promising possibility is that the use of a two-timescale expansion could give 
just such a scheme. In the two-timescale scheme, rather than using an expansion 
about a background geodesic (which is valid over short timescales characterised by 
the orbital motion), one introduces an additional radiation reaction timescale into the 
problem and incorporates the relevant effects over this radiation reaction timescale. 

The relevant two-timescale expansion of the worldline equations of motion was 
completed in Ref. 152], and follow-up work has improved our understanding of 
important resonance effects for inspiral orbits 153, 170-179] . In order to consistently 
incorporate a two-timescale expansion into an orbital evolution scheme, it will also 
be necessary to have a two-time expansion of the field equations. Promising progress 
towards such an expansion was recently reported in 1180], indicating that it is likely 
that a self-consistent orbital evolution using a two-timescale expansion is indeed 
feasible. 



5 Discussion 

The techniques described in this review represent the current state of the art of self- 
force calculations. The three primary approaches: worldline convolutions, mode-sum 
regularization and the effective source approach can be considered complimentary, 
with each having regimes where they are most appropriate: 



514 



B. Warden 



• The mode-sum scheme gives unparalleled accuracy (particularly in the frequency 
domain) for orbits which have a small frequency spectrum. 

• The worldline convolution method provides valuable physical insight and can be 
easily applied to arbitrary orbital configurations, including those inaccessible by 
other means. 

• The effective source approach can be used in arbitrary spacetimes without relying 
on any symmetries, and also stands out as the most applicable to a second order 
calculation. 

There still remain important developments to be made in each case. For example, 
the mode sum and effective source schemes are still under development for the Kerr 
gravitational case, and worldline convolution approaches have yet to be fully applied 
to the gravitational case for any spacetime. 

Finally, it should be pointed out that this review is not an exhaustive exposi- 
tion of all self-force computation strategies. Many other calculations have not been 
described, including alternative regularization strategies [66, 181, 182], near-horizon 
waveform calculations [183-186], methods based on effective field theory [187- 
190], analytic calculations in particularly simple cases [95, 191, 192], black holes 
in higher dimensions [193], and calculations in non-black hole spacetimes such as 
wormholes [194, 195] and cosmological models [196, 197]. More details can be 
found in the reviews [60, 61], and references therein. 
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On the Self-force in Electrodynamics 
and Implications for Gravity 



Volker Perlick 



Abstract We consider the motion of charged point particles on Minkowski 
spacetime. The questions of whether the self-force is finite and whether mass renor- 
malisation is necessary are discussed within three theories: In the standard Maxwell 
vacuum theory, in the non-linear Born-Infeld theory and in the higher-order Bopp- 
Podolsky theory. In a final section we comment on possible implications for the 
theory of the self-force in gravity. 



1 Introduction 

The problem of the electromagnetic self-force has a long history. It began in the 
late 19th century when Lorentz, Abraham and others tried to formulate a classical 
theory of the electron. The idea was to model the electron as an extended, at least 
approximately spherical, charged body and to determine the equations of motion 
for the electron. Based on earlier results by Lorentz, Abraham succeeded in writing 
the equation of motion in terms of a power series with respect to the radius of the 
electron. If the radius tended to zero, i.e., for a point charge, an infinity occurred. 
The reason for this infinity is in the fact that, in the point-particle limit, the electric 
field strength diverges so strongly at the position of the charge that the field energy 
in an arbitrarily small sphere becomes infinitely large. To get rid of this infinity, it 
was necessary to “renormalise the mass” of the particle by assuming that it carries a 
negative infinite “bare mass”. After this mass renormalisation, one got a differential 
equation of third order for the motion of the particle which is known as the Abraham- 
Lorentz equation. It is a non-relativistic equation in the sense that, on the basis of 
special relativity, it can hold only if the particle’s speed is small in comparison to the 
speed of light. 

A fully relativistic treatment of the problem had to wait until Dirac’s work [1] of 
1938. The resulting equation of motion is known as the Lorentz-Dirac equation or 
as the Abraham-Lorentz-Dirac equation. Clearly, everyone would call it the Dirac 
equation except for the fact that this name was already occupied by another, even 
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more famous equation. Neither Lorentz nor Abraham has ever seen the (Abraham-) 
Lorentz-Dirac equation, because both had passed away in the 1920s. In particular 
in the case of Abraham it is rather clear that he would not have liked this equation 
because he was an ardent opponent of relativity. Therefore, it seems appropriate 
to omit his name and call it the Lorentz-Dirac equation. For the derivation of the 
Lorentz-Dirac equation, again mass renormalisation was necessary and one arrived 
at a third-order equation of motion. The latter fact means that, in contrast to other 
equations of motion, not only the position and the velocity but also the acceleration 
of the particle has to be prescribed at an initial instant for fixing a unique solution. 
Moreover, the Lorentz-Dirac equation is notorious for showing unphysical behaviour 
such as run-away solutions and pre- acceleration. For a detailed discussion of the 
Lorentz-Dirac equation, including historical issues, we refer to Rohrlich [2] and to 
Spohn [3]. 

The trouble with the Lorentz-Dirac equation clearly has its origin in the fact that 
the electric field strength of a point charge becomes infinite at the position of the 
charge, and that this singularity is so strong that the field energy in an arbitrarily small 
ball around the charge is infinite. A possible way out is to modify the underlying 
vacuum Maxwell theory in such a way that this field energy becomes finite. Two such 
modified vacuum Maxwell theories have been suggested in the course of history, the 
non-linear Born-Infeld theory [4] and the linear but higher-order Bopp-Podolsky 
theory [5, 6]. It is the main purpose of this article to discuss to what extent these 
theories have succeeded in providing a theory of classical charged point particles 
with a finite self-force and a finite field energy. 

Some people are of the opinion that there is no need for a consistent theory of 
classical charged point particles. They say that either one should deal with extended 
classical charge distributions or with quantum particles. However, this is not convinc- 
ing. E.g. in accelerator physics it is common to describe beams in terms of classical 
point particles; neither a description in terms of extended charge distributions nor 
in terms of quantum matter seems to be appropriate or even feasible. Therefore, a 
consistent and conceptually well-founded theory of classical charged point particles 
is actually needed. 

The problem of the electromagnetic self-force of a charged particle has a coun- 
terpart in the gravitational self-force of a massive particle. In comparison with the 
electromagnetic self-force, the gravitational self-force is plagued with additional con- 
ceptual issues. The latter are related to the facts that Einstein’s field equation does not 
admit solutions for sources concentrated on a worldline, see Geroch and Traschen [7] , 
and that an extended massive particle becomes a black hole if it is compressed beyond 
its Schwarzschild radius. However, by considering the self-interacting massive par- 
ticle as a perturbation of a fixed background spacetime one arrives at a formalism 
which is similar to the electromagnetic case, see the comprehensive review by Pois- 
son et al. [8]. At this level of approximation it is reasonable to ask if modifications 
of the vacuum Maxwell theory can be mimicked by modifying Einstein’s theory in 
such a way that the (approximated) gravitational self-force becomes finite. We will 
come back to this question at the end of this article, after a detailed discussion of the 
electromagnetic case. 
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2 Maxwell’s Equations and the Constitutive 
Law for Vacuum 

Maxwell’s equations are universal and they do not involve a metric or a connection. 
They read 



dF = 0, dH = j, (1) 

where F is an untwisted two-form, if is a twisted two-form and j is a twisted three- 
form. (A differential form is twisted if its sign depends on the choice of an orientation. 
The difference between twisted and untwisted differential forms becomes irrelevant 
if the underlying manifold is oriented.) F gives the electromagnetic field strength, 
H gives the electromagnetic excitation and j gives the electromagnetic current. Our 
notation follows Hehl and Obukhov [9] . 

The Eqs. (1) are referred to as the premetric form of Maxwell’s equations. These 
equations immediately imply that on simply connected domains F can be represented 
in terms of a potential. 



F = dA, (2) 

and that charge conservation is guaranteed, 

dj = 0. (3) 

If j is given. Maxwell’s equations must be supplemented with a constitutive law 
relating F and H to specify the dynamics of the electromagnetic field. There is a 
particular constitutive law for vacuum, and there is a particular constitutive law for 
each type of medium. In any case, the constitutive law will involve some background 
geometry. In the following we consider vacuum electrodynamics on Minkowski 
spacetime. Then the constitutive law should involve the Minkowski metric tensor 
and no other background fields. 

On Minkowski spacetime, we may choose an orthonormal coframe, i.e., four 
linearly independent covector fields 0^,0^, 0^,0^ such that the Minkowski metric is 
represented as 



g = VabO“ ® 9^ (4) 

where (rjab) = diag(— 1, 1, 1, 1). Here and in the following we use the summation 
convention for latin indices that take values 0, 1, 2, 3 and for greek indices that 
take values 1, 2, 3. Latin indices will be lowered and raised with pab and its inverse 
respectively, while greek indices will be lowered and raised with the Kronecker 
symbol and its inverse 5^^ , respectively. 
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With respect to the chosen orthonormal coframe, the electromagnetic field strength 
can be decomposed into electric and magnetic parts, 

F = (5) 

Here the wedge denotes the antisymmetrised tensor product and is the Levi- 
Civita symbol, defined by the properties that it is totally antisymmetric and satisfies 
5^23 = 1. The electromagnetic excitation can be decomposed in a similar fashion, 

= + (6) 

If we apply the Hodge star operator of the Minkowski metric to F and H, we find 

*F = -5^0^A0O + lF%^^0^A0^ (7) 

*H = A 6»° - a r . (8) 

The field energy density measured by an observer whose 4-velocity V satisfies 
0^^{V) = 0 for /i = 1, 2, 3 is given by 

£= + (9) 

With the help of the Hodge star operator we can form out of F the untwisted 
scalar invariant 



\F A *F) = (10) 

and the twisted scalar invariant 

%F aF) = -2E^B^. (11) 

All these equations are valid with respect to any orthonormal coframe. In partic- 
ular, we may choose a holonomic coframe, i.e., we may choose inertial coordinates 
on Minkowski spacetimes. 



g = (8) dx^ (12) 

and then write 0^ = dx^ . In the following we will see that it is sometimes convenient 
to work with an anholonomic orthonormal coframe on Minkowski spacetime. 

We will now discuss the vacuum constitutive law in three different theories. 
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2.1 Standard Maxwell Vacuum Theory 



In the standard Maxwell theory, the constitutive law of vacuum reads 



H = *F. 



(13) 



By comparison of (6) and (7) we see that this implies 



DP = EP, HP = BP. 



(14) 



Here and in the following, we use units making the permittivity of vacuum, sq, the 
permeability of vacuum, and thus the vacuum speed of light, c = 
equal to one. 



2.2 Born-Infeld Theory 

In 1934, Bom and Infeld [4] suggested a non-linear modification of the vacuum 
constitutive law. 



where Z? is a new hypothetical constant of nature with the dimension of a (magnetic 
or electric) field strength. The idea behind this modified constitutive law is to find a 
theory where the field energy of a point charge remains bounded. We will discuss in 
the following sections to what extent this goal was achieved. 

Maxwell’s equations with the Born-Infeld constitutive law (15) can be derived 
from a Lagrangian that depends only on the invariants (10) and (11). This demon- 
strates that the theory is not only gauge invariant but also Lorentz invariant. However, 
we will not need the Lagrangian formulation in the following. 

As the constitutive law (15) does not involve any derivatives, in the Bom-Infeld 
theory the vacuum Maxwell equations are still of first order with respect to the field 
strength (i.e., of second order with respect to the potential), just as in the standard 
Maxwell theory. However, they are now non-linear. 

Obviously, the Born-Infeld constitutive law (15) approaches the standard vacuum 
constitutive law (13) in the limit b ^ oo. This implies that the Born-Infeld theory is 
indistinguishable from the standard Maxwell vacuum theory if b is sufficiently big. 
In this sense, any experiment that confirms the standard Maxwell vacuum theory is 
in agreement with Bom-Infeld theory as well, and it gives a lower bound for b. For 
the purpose of this article, the specific value of b is irrelevant as long as it is finite. 




H = 



(15) 



*{F A *F) {*{F A F)f 
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Decomposing the constitutive law (15) into electric and magnetic parts results in 



DP 



n ErB^ ^ 



1 + - EfjE' 






(16) 



= 



ErB^ 

B^^ -^E>^ 



1 , , {E^B'^y 

l + -g^{BaB^ -E^E^)- ^ ’ 



b^ 



(17) 



2.3 Bopp-Podolsky Theory 

Another modification of the vacuum constitutive law, again motivated by the wish 
of having the field energy of a point charge finite, was brought forward in 1940 
by Bopp [5]. The same theory was independently re-invented two years later by 
Podolsky [6]. The Bopp-Podolsky theory is equivalent to another theory that was 
suggested in 1941 by Lande and Thomas [10]. 

The Bopp-Podolsky vacuum constitutive law reads 

H = *E-e^D*E (18) 



where 



□ = *J*J + J*J* (19) 

is the wave operator on Minkowski spacetime and f is a new hypothetical constant of 
nature with the dimension of a length. In contrast to the Born-Inf eld constitutive law, 
the Bopp-Podolsky constitutive law is linear. However, it involves second derivatives 
of the field strength, so Maxwell’s equations give a system of fourth-order differen- 
tial equations for the potential A. In the Lande-Thomas version of the theory one 
splits the potential into two parts each of which satisfies a second-order differential 
equation, see Sect. 4.3 below. Just as the Bom-Infeld theory, the Bopp-Podolsky can 
be derived from a gauge-invariant and Lorentz-invariant Lagrangian (see Bopp [5] 
or Podolsky [6]) but we will not use the Lagrangian formulation in the following. 

For f ^ 0, the Bopp-Podolsky constitutive law (18) approaches the standard 
vacuum law (13). So any experiment that is in agreement with the standard Maxwell 
theory is in agreement with the Bopp-Podolsky theory as long as I is sufficiently 
small. However, dealing with the limit i ^ 0 requires some care because it is a 
singular limit of Maxwell’s equations in the sense that it kills the highest-derivative 
term. 
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3 Field of a Static Point Charge 

It is our goal to discuss the field of a point charge in arbitrary motion on Minkowski 
spacetime (subluminal, of course) according to the standard Maxwell vacuum theory, 
the Born-Infeld theory and the Bopp-Podolsky theory. As a preparation for that, it 
is useful to consider first the simple case of a point charge that is at rest in the 
spatial origin of an appropriately chosen inertial coordinate system. (Obviously, in 
any other inertial system the charge is then in uniform and rectilinear motion.) In this 
inertial system, the field produced by the charge must be spherically symmetric and 
static because there are no background structures that could introduce a deviation 
from these symmetries. Writing r — (v^ v^) for the coordinates and E — 

{E^ etc. for the fields in the chosen inertial system. Maxwell’s equations 

(1) reduce to 

Vx£ = 0, V-B=0, VxW = 0, V-5 = ^(5<2^(r), (20) 

where q is the charge and 5^^'^ is the three-dimensional Dirac delta distribution. While 
the curl equations are satisfied by any spherically symmetric E and H fields, the 
divergence equations determine the spherically symmetric D and B fields uniquely, 

D=-^er, B=Q, (21) 

47rr^ 

where 6r is the radial unit vector. So whatever the constitutive law may be, the electric 
excitation D always has its standard Coulomb form, i.e., it diverges like r~^ if the 
position of the charge is approached, and the magnetic field strength B vanishes 
everywhere. The corresponding (spherically symmetric) electric field strength E 
and magnetic excitation H are not restricted by Maxwell’s equations; they have to 
be determined from the constitutive law. 



3.1 Standard Maxwell Vacuum Theory 

In the standard Maxwell vacuum theory the constitutive law simply requires E = D 
and B = H. Hence, (21) says that E is the standard Coulomb field and that H 
vanishes. 



E = n = 0 . (22) 

47T^Z 

Clearly, |^| becomes infinite at the origin, i.e., at the position of the charge. The 
direction of E is always radial, so in the limit r ^ 0 the direction may be any unit 
vector depending on how the origin is approached. As these direction vectors average 
to zero, the Lorentz force {^E) exerted by the static particle onto itself vanishes. Here 
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we follow the widely accepted hypothesis that the self-force results from averaging 
over directions, cf. e.g. Poisson et al. [8]. This hypothesis is very natural if one thinks 
of the point particle as being the limiting case of an extended (spherical) body. 

The field energy in a ball Kr of radius R around the origin is 

f 1 ^ ^ 9 n n dr 

W(R)= -D ■ Erhin'ddrd'ddif = — -r. (23) 

Jkr 2 Stt Jo 

Clearly, this expression is infinite, for arbitrarily small R. Both E and D throw in a 
factor of one of them is killed by a factor of from the volume element but 
the other one makes the integral diverge. We see that we can cure this infinity by 
introducing a modified constitutive law that leaves E bounded if the position of the 
charge is approached. We will now verify that both the Bom-Inf eld theory and the 
Bopp-Podolsky theory have this desired property. 



3.2 Born-Infeld Theory 



As 5 = 0 by (21), in the Born-Infeld theory the constitutive law requires 



b = 




n = o. 



(24) 



With D given by (21), we have to solve the equation 




(25) 



for E to determine the electric field strength. The result is (Bom and Infeld [4]) 



E = 



47ryr4 + r4 



er. 



2 _ ^ 



(26) 



Hence \E\ Z? for r ^ 0, see Fig. 1. Note that the limit of ^ for r ^ 0 does 
not exist because the direction of the limit vector depends on how the position of 
the point charge is approached. One may say that the electric field strength stays 
bounded but has a directional singularity at the origin. By averaging over directions, 
the self-force (~^) of the static particle vanishes. 
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Fig. 1 Modulus of the 
electric field strength for a 
static charge in the 
Born-Inf eld theory {solid) 
and in the standard Maxwell 
vacuum theory {dashed) 



E 




The field energy in a ball Kr of radius R around the origin is 



W(R) = [ 
Jkr 



1 ^ ^ 

-D • Er^smddrd'ddip = — / 

2 ^7T Jo 



dr 



(27) 



This is an elliptic integral which is finite as long as tq > 0, i.e., as long as b is finite. 
Even the field energy in the whole space is finite, 



lim W{R) = 

R^oo 



g^r(5/4)^ 

2ro7T^/^ 



where T is the Euler gamma function. 



(28) 



3.3 Bopp-Podolsky Theory 



In this case the constitutive law requires 

D = E-£^AE, h = 0. 



(29) 



With D given by (21), we have to solve the second-order differential equation 



E -fAE 



a ^ 

47T^2 ^ 



(30) 



to determine E. Eor a spherically symmetric field, E{f) = £’(r)?^(r), this reduces 
to 



E - 




a 

47T^2 



(31) 
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The general solution is 

E = ^[l+Ci€(r-£)e''/^-C2€(r+£)e-''/^j?, (32) 

with two integration constants C\ and C 2 . The first integration constant is fixed if 
we require E to fall off towards infinity; this yields C\ = 0. The second integration 
constant is fixed if we require E to stay bounded if the position of the charge is 
approached; this yields C 2 = This gives us the Bopp-Podolsky analogue of the 
Coulomb E field (Bopp [5]; Podolsky [6]) 

^=4^I‘-G+‘K"'K 

which satisfies |^| ^ qK^irl^) for r ^ 0, see Fig. 2. Just as in the Bom-Infeld 
case, the electric field strength stays bounded but has a directional singularity at the 
origin. 

The field energy in a ball Kr of radius R around the origin is 

W(R) = [ -D- Er^sinMrdMip (34) 

Jkr 2 




which is finite as long as f > 0. As in the Born-Infeld theory, even the field energy 
in the whole space is finite. 



lim 

R^oo 



W(R) = 



87t£ 



(35) 



Fig. 2 Modulus of the 
electric field strength for a 
static charge in the 
Bopp-Podolsky theory 
{solid) and in the standard 
Maxwell vacuum theory 
{dashed) 



E 
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4 Field of an Accelerated Point Charge 

We have seen that both the Born-Infeld theory and the Bopp-Podolsky theory modify 
the Coulomb E field of a point charge at rest in such a way that | ^ | is bounded and 
that the field energy in a ball around the charge is finite. Of course, what one is really 
interested in is the field produced by an accelerated charge. We will now try to find 
out what can be said about this case. 

We choose an inertial coordinate system on Minkowski spacetime, g = r]abdx^ (g) 
dx^ . We fix a timelike curve z^(r) parametrised by proper time. 



We assume that this timelike curve is inextendible. As an accelerated worldline may 
reach (past or future) infinity in a finite proper time, this does not necessarily mean 
that the parameter r ranges over all of M. We denote the interval on which r is defined 
by iTmin, Tmax[ where -00 < Tmin < ^max < 00. 

We want to determine the electromagnetic field of a point charge that moves 
on the worldline z^(r). For convenience, we introduce an orthonormal tetrad 
(^o(^). ^3 ('?■)) along the worldline of the charged particle such that 



see Fig. 3. Along the worldline, this fixes the timelike vector ^o(^) everywhere and 
the spacelike vector ^ 3 (r), up to sign, at all events where the acceleration z{r) is 

Fig. 3 Retarded light-cone ^0 

coordinates and orthonormal 

coframe \ T 




(36) 



= z“{t), a{T)e^{T) = t{T), 



(37) 
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non-zero. e\ (r) and e 2 {r) are then fixed up to a rotation in the plane perpendicular 
to ^ 3 (r). At points where the acceleration is zero, e^ir) is ambiguous; there are 
pathological cases where it is not possible to extend it into such points such that the 
resulting vector field is continuously differentiable. We exclude such cases in the 
following and assume that the tetrad is smoothly dependent on r and satisfies (37) 
along the entire worldline. 

With respect to this tetrad, we can introduce retarded light-cone coordinates 
(r, r, if) which are related to the inertial coordinates (x^, x\x^, x^) by 



n^(r, i3, (p) = cosip sin'deiir) -h simp -h cos'de^ir). (39) 

Retarded light-cone coordinates are routinely used nowadays when treating self- 
force problems, cf. e.g. Poisson et al. [8]. These coordinates have a long history. In 
connection with electrodynamics on Minkowski spacetime, they were introduced by 
Newman and Unti [11] in 1963. In particular, Newman and Unti demonstrated that 
in these coordinates the Lienard-Wiechert potential takes a surprisingly simple form. 
In general relativity the history of light-cone coordinates is even older. They made 
their first appearance in a 1938 paper by Temple [12] who called the time-reversed 
version (i.e., the advanced light-cone coordinates) “optical coordinates”. Advanced 
light-cone coordinates are used in gravitational lensing and in cosmology where the 
wordline is interpreted as an observer who receives light (rather than as a source that 
emits radiation). 

In retarded light-cone coordinates, the “temporal” coordinate r labels the future 
light-cones with vertex on the chosen worldline; r is a radius coordinate along each 
light-cone and (i^, (p) are standard spherical coordinates that parametrise the spheres 
(r, r) = constant. Of course, there are the usual coordinate singularities of the 
spherical coordinates at the poles sin^ = 0 and (p is defined only modulo 27 t. If these 
coordinate singularities are understood, the system of retarded light-cone coordinates 
is well-defined on an open subset, U, which equals the causal future of the worldline 
with the worldline itself being omitted. Figure 4 shows a worldline that approaches 
the speed of light in the past. In this case the causal future of the worldline is bounded 
by a lightlike hyperplane to which the worldline is asymptotic. For a worldline that 
does not approach the speed of light in the past, the causal future is all of Minkowski 
spacetime. (Recall that we consider only wordlines that are inextendible.) 

With the retarded light-cone coordinates (r,r,'d,p) we can associate an ortho- 
normal coframe (0^ , 0^ , 0^ , 0^) defined by 




(38) 



where 
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Fig. 4 Domain of definition, 
U , of the retarded light-cone 
coordinates 




0^ = dr + dr ra(r)cosMT (40) 

0^ = dr ra{r)cosddT 
0^ = rdi^ — ra(T)sinddr 
0^ = rsinddip, 



see Fig. 3. 

The electromagnetic field of the point charge is to be determined by solving the 
Maxwell equations (1) with 

O PTmax \ 

- z{T))z‘'(T)dT\rjabdx^ (41) 

"Tmin / 

where is the 4-dimensional Dirac delta distribution. The solution has to satisfy 
the vacuum constitutive law on the open domain U and it should be retarded. By 
the latter requirement we mean that the field strength at an event x e U should be 
completely determined by what the point charge did in the causal past of the event x. 



4.1 Standard Maxwell Vacuum Theory 



In the case of the standard Maxwell vacuum theory, finding the field of a point 
charge on Minkowski spacetime is a standard text-book matter. The solution is F = 
JA, H — *F, where 




Arr 



47T 



<dr + dr 



“h d 



(r)cos'i^Jr^ 



r 



(42) 
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is the (retarded) Lienard-Wiechert potential. At an event x G [/, the potential is 
determined by the 4-velocity and the 4- acceleration of the point charge at the retarded 
time which is given by the coordinate r. There are no “tail terms”, i.e., there is no 
dependence on the earlier history of the point charge. 

For deriving the Lienard-Wiechert potential in a systematic way, one introduces 
the potential, F = dA, and imposes the Lorenz gauge condition, = 0. Then the 
first Maxwell equation, dF = 0, is automatically satisfied and the second Maxwell 
equation, dH = j, becomes an inhomogeneous wave equation for A, 

□A = (*J*J -h J*J*) A = *J*F = = * 7 . (43) 



With the well-known (retarded) Green function of the wave operator □, the (retarded) 
solution can be written as an integral over * 7 . Inserting the current from (41) gives 
the desired result. 

From the Lienard-Wiechert potential we find that the field strength F = dA and 
the excitation H = "^d A are given by 



and 



47t \ r / 

q /dr A dr 
47 t \ r 



-h a(r)smddi9 A dr 



) 



q / 0^ aO^ a{r) 



H = 

47T \ 



sini?6>2 A{0^ -e 



■') 



= — ^mddd A dip, 
47T 



(44) 



(45) 



respectively. Decomposing into electric and magnetic parts yields 



E^ei^ = D^ei^ = +a(r)sin^?Lj , 

o 0^ 

BnO^ — HnO^ = — a(r)sim9 — . 

47 t r 



(46) 

(47) 



In addition to the “Coulomb part”, which goes with 1/r^, we have in the case of a 
non- vanishing acceleration a “radiation part” which goes with 1/r. The self-force, 
i.e. the Lorentz force exterted onto the point charge by its own field, is given as 
the limit of qE/^Qf^ if the position of the point charge is approached. The Coulomb 
part averages to zero, as in the case of a static charge. The radiation part, however, 
does not average to zero; it gives an infinite self-force whenever the acceleration 
a(r) is non-zero. As in the static case, the field energy in an arbitrarily small sphere 
around the point charge is infinite. It is this infinite amount of energy carried by 
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the point charge with itself that makes mass renormalisation necessary if one wants 
to formulate an equation of motion for the point charge taking the self-force into 
account. 



4.2 Born-Infeld Theory 

If one wants to find the field of an accelerated point charge in the Born-Infeld theory, 
one would try to mimic the derivation of the Lienard-Wiechert potential as far as 
possible. As in the standard Maxwell theory, one can satisfy the first Maxwell equa- 
tion by introducing the potential and one can impose the Lorenz gauge condition (or 
any other gauge condition if this appears to be more appropriate). However, with H 
given in terms of F = dAhy the Born-Infeld constitutive law, the second Maxwell 
equation now becomes a non-linear inhomogeneous wave equation for A. There 
are no standard methods for solving such an equation; in particular. Green function 
methods are not applicable. Therefore, we cannot write down a Born-Infeld analogue 
of the Lienard-Wiechert potential. In the Bom-Infeld theory, no explicit solution of 
the electromagnetic field of a point charge with non- vanishing acceleration seems to 
be known. 

One might say that it is not actually necessary to write down a solution explicitly. 
It would be sufficient if one could verify some properties of the solution. Firstly, it 
would be highly desirable to prove that, for a point charge moving on an arbitrary 
worldline or on a worldline subject to some conditions, the retarded electromagnetic 
field is unique and regular on U . Secondly, it would be highly desirable to know if 
for this solution the self-force and the energy in a ball around the charge are finite. 
However, very little is known about these issues in the Bom-Infeld theory beyond 
the case of an unaccelerated point charge. 

As to regularity, it seems worth wile to point out that even for a time-independent 
and smooth j the question of regularity is highly non-trivial. It was shown only 
recently by Kiessling [13] that in this case the electromagnetic field is, indeed, free 
of singularities or discontinuities. Although this result seems to be intuitively quite 
obvious, the proof is difficult and very technical. It is based on series expansions 
with respect to 1/Z?^, where h is the Bom-Infeld constant, and the hard part is in the 
proof of convergence. For the field of an accelerated point charge, it is very well 
conceivable that infinities or discontinuities (“shocks”) are formed. It is true that 
Boillat [14] has shown the non-existence of some kind of shocks in the Bom-Infeld 
theory, but these results do not apply to the case at hand where the equations become 
singular along a worldline. 

Even if it is possible to show that the electromagnetic field of a point charge is 
regular on U, either for all worldlines or for a special class of worldlines, it is far 
from obvious that the field has the same behaviour as in the static case if the position 
of the point charge is approached. A discussion of related issues can be found in a 
paper by Chruscinski [15] ; this, however, is based on the assumption that the electric 
field strength remains bounded and that the electric excitation diverges like r~^ if 
the position of the point charge is approached. In contrast to the retarded light-cone 
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coordinates used here, Chruscihski used Fermi coordinates in a similar fashion as 
they had been used already earlier by Kijowski [16] in the context of the standard 
Maxwell vacuum theory. 

Something can be said, at least, for the case of a point charge that is initially at 
rest and then starts accelerating. In this case, conservation of energy guarantees that 
the total field energy must be finite for all times. However, even in this case it is not 
clear if shocks are excluded. 

For approaching the problem in a systematic way, one may write the electromag- 
netic field strength as a power series with respect to 1 

^ Fn Fi 

^=Z^ = ^o + :^ + ---’ Ff,=dAM. (48) 



Inserting this expression into the Born-Infeld constitutive law (15) and collecting 
terms of equal powers of I /b^ gives 

00 00 ^ 

^ = Z = Z ^ (^0, . . . , Fm-i)) (49) 

where Wa^(Fo, . . . , F^-i) stands for an expression depending on Fq, , F^-i 
that can be explicitly calculated for every N. We have to determine the = dA^ 
such that dH = j with the current given by (41). This can be done by requiring 

dHo = j, dHN = 0 for N = 1,2, ... (50) 

and solving these equations iteratively. We may impose the Lorenz gauge condition 
on each Then the zeroth order retarded solution is known to be the standard 
Lienard-Wiechert field, Fq = dAo with Aq given by the right-hand side of (42). The 
higher-order F^ = dA^ slvo determined by 

d{*dAN + WNidAo,...,dAN-i)) =0. (51) 

In the Lorenz gauge, this is the standard inhomogeneous wave equation for with 
the inhomogeneity given in terms of the lower-order solutions Aq, . . . , Aa^_i, 

□Ayv=*7w, Jn = -dWN[dAo,...,dAN-i). (52) 

The retarded solution of this equation is known from classical electrodynamics: It is 
the retarded potential of the “current” three-form jW- In this way, we can iteratively 
determine the A^ and write the solution F = JA as a formal power series. 

The big question, unanswered so far, is whether or not this series converges. We 
do know that it does converge in the case of vanishing acceleration; then we get 
the field of a static point charge discussed in Sect. 3.2. For non-zero acceleration, 
however, no convergence results are known. 
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4.3 Bopp-Podolsky Theory 



In the case of the Bopp-Podolsky theory the situation is much better than in the 
case of the Born-Inf eld theory. The Bopp-Podolsky theory is linear, so it allows for 
applications of the Green function method. 

With F = dA and choosing the Lorenz gauge, = 0, the remaining field 
equation reads 



□A - = *j . (53) 

This fourth-order equation for A can be reduced to a pair of second-order equations 
□A = *j, DA - r^A = j, (54) 

if we write 

A = A - A (55) 

A ;= A - l^UA, A := -t^DA. (56) 



If rewritten in this way, a quantised version of the theory would predict the existence 
of a massless photon, described by A, and a massive photon with Compton wave- 
length I, described by A. Both Bopp [5] and Podolsky [6] had realised that their 
higher-order theory can be rewritten in this way as a two-field theory. This two-field 
theory is precisely what Lande and Thomas [10] independently suggested one year 
after Bopp and one year before Podolsky. 

One can thus construct the (retarded) solution to the fourth-order equation (53) 
from the (retarded) Green functions of the wave equations (54). The latter are well 
known, see e.g. the original paper by Lande and Thomas [10]. This gives the retarded 
solution to (53) for the current (41) of a point charge as 



A(x) = 




7l(s(x, t')/^) .a 




Vabdx^ 



(57) 



where 



s{x, T'f = -(x" - z‘‘(T')){xa ~ Za(r')) (58) 

and Ji is the Bessel function of the first kind. The geometric meaning of 5'(v, r') is 
illustrated in Fig. 5. 

The potential (57) is the Bopp-Podolsky analogue of the Lienard-Wiechert poten- 
tial. In contrast to the standard Lienard-Wiechert potential, it depends on the entire 
earlier history of the point charge up to the retarded time r. Such “tail terms” are 
nothing peculiar; they also occur in the standard vacuum Maxwell theory on a curved 
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Fig. 5 s(x,t') is the 
Lorentz length of the 
timelike line segment that 
connects x with z(r') 




background, see e.g. Poisson et al. [8]. The integral in (57) and in the correspond- 
ing expression for the field strength can be expanded in a formal power series with 
respect to 1. For the self-force, after averaging over directions this results in a series 
with terms of order I, , see Zayats [17] (also cf. McManus [18], 

Frenkel [19] and Frenkel and Santos [20]). However, these series are non-convergent 
and, therefore, of limited use. 

So in contrast to the Born-Inf eld theory, in the Bopp-Podolsky theory the electro- 
magnetic potential (and, thereupon, the electromagnetic field strength) produced by 
an arbitrarily accelerated point charge can be explicitly written down, albeit in terms 
of an integral over the particle’s earlier history. A detailed discussion of the class of 
worldlines for which this integral absolutely converges will be given elsewhere [21]. 
This demonstrates that, for a large class of worldlines, the electric field stays bounded 
and there is no need for mass renormalisation. As an important example, the self-force 
of a uniformly accelerated point charge was calculated by Zayats [17]. 

Because of the tail terms, the equation of motion is no longer a differential equation 
but rather an integro-differential equation for the worldline. It is unknown if the 
equation of motion admits run-away solutions. For some partial results, indicating 
that run-away solutions cannot exist if i is bigger than a certain critical value, see 
Frenkel and Santos [20]. 



5 Implications for Gravity 

The preceding discussion can be summarised in the following way. In the standard 
Maxwell vacuum theory, the self-force is infinite and mass renormalisation is neces- 
sary. Postulating a negative infinite bare mass is conceptually not satisfactory and the 
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resulting equation of motion, the Lorentz-Dirac equation, is highly pathological. In 
the Born-Infeld theory, the properties of the field of a static charge look promising, 
but for an accelerated charge very little can be calculated and the properties of the 
field are largely unknown. For the Bopp-Podolsky theory, the field of an accelerated 
point charge can be calculated, in terms of an integral over the history of the particle 
which is manageable to a certain extent, and it can be shown for a large class of 
accelerated worldlines that the field is, actually, finite. No negative infinite bare mass 
needs to be postulated, and the equation of motion can be assumed to be the usual 
Lorentz-force equation with the (finite) self-field included after averaging over direc- 
tions. The explicit expression of the electromagnetic field, given by the analogue of 
the Lienard-Wiechert potential, is more complicated than in the standard vacuum 
Maxwell theory on Minkowski spacetime, because of the tail terms. However, such 
tail terms are familiar from the standard vacuum Maxwell theory on a curved space- 
time and should not be viewed as a reason for discarding the theory. Although there 
are still several open issues — most notably the absence or non-absence of run-away 
solutions has to be clarified — it seems fair to say that in the Bopp-Podolsky theory 
the infinities associated with point charges are cured to a large extent. We may there- 
fore view it as the best candidate for a conceptually satisfactory theory of classical 
charged point particles. (This does not necessarily mean that the Bopp-Podolsky 
theory is “the correct theory of electromagnetism” at a fundamental, quantum field 
theoretical, level). 

Do these observations teach a lesson with respect to the gravitational self-force? 
In the approximation where the self-gravitating particle is viewed as a perturbation 
of a fixed background spacetime, the theory is very similar to the electromagnetic 
case in the standard Maxwell vacuum theory. Modifying the theory along the lines 
of the Born-Infeld theory seems to be of no use: Firstly, it is largely unclear if the 
Bom-Infeld theory really cures the infinities in the field of an accelerated point 
charge. Secondly, the original Einstein theory was already a non-linear theory whose 
non-linearities had been killed by setting up the approximation formalism for the 
self-gravitating point mass. Therefore, it seems rather meaningless to re-introduce 
non-linear terms. The situation is quite different for the Bopp-Podolsky theory. Here 
linearity is kept but higher-order terms are added. It seems not unreasonable to assume 
that Einstein’s theory can be modified by adding higher-order terms in such a way 
that they survive the approximation, giving rise to a regularising term of the same 
kind as it occurs in the Bopp-Podolsky theory. Higher-order theories of gravity have 
been investigated intensively. They are mainly motivated by the observation that 
quantum corrections to Einstein’s theory are expected to give a Lagrangian that is 
of quadratic or higher order in the curvature, resulting in field equations that involve 
fourth derivatives of the metric. (The simplest class of such theories is the class 
of f(R) theories which are reviewed, e.g., in the Living Review by de Felice and 
Tsujikawa [22]). Looking for a version that gives rise to a Bopp-Podolsky-like term 
seems to be a promising programme that might give a new theoretical framework for 
getting a finite gravitational self-force. 
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Self-gravitating Elastic Bodies 



Lars Andersson 



Abstract Extended objects in GR are often modelled using distributional solutions 
of the Einstein equations with point-like sources, or as the limit of infinitesimally 
small “test” objects. In this note, I will consider models of finite self-gravitating 
extended objects, which make it possible to give a rigorous treatment of the initial 
value problem for (finite) extended objects. 



1 Introduction 

Extended objects in GR are often modelled using distributional solutions of the Ein- 
stein equations with point-like sources, or as the limit of infinitesimally small “test” 
objects. In this context, gravitational self-force manifests itself through corrections 
to geodesic motion, in analogy to radiation reaction. This is relevant for example in 
the analysis of extreme mass ration inspirals, see [1]. See also the papers by Harte 
[2] and Pound [3] for background on the self-force problem. 

A widely studied model for objects with internal structure in general relativity 
are so-called spinning particles. There are several formal approaches to deriving the 
corrections to geodesic motion for such object, see [4] for a survey. These works 
rely to a large extent on the study of distributional stress-energy tensors representing 
the particle-like objects. On the other hand, limiting procedures have been applied 
to study objects with internal structure by Wald and collaborators, cf. [5]. In this 
note, I will consider models of finite self-gravitating extended objects, which make it 
possible to give a rigorous treatment of the initial value problem for (finite) extended 
objects. Such models could serve as a basis for the above mentioned limiting con- 
siderations. 

A serious difficulty in treating self-gravitating material bodies in general relativity, 
is that matter distributions with finite extent are typically irregular at the surface of the 
body. This phenomenon can be seen already by considering a stationary Newtonian 
polytrope, with equation of state 
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p = Kp^ 



Then the density p behaves as 



p{x) ~ Jt'-i (x, dO) 

where J(x, dO) is the distance to the boundary of the body. Recall that the sound 
speed Cs for such a polytrope is given by 

and hence Cs tends to zero at 3^2. It follows that the hyperbolicity of the Euler equa- 
tions degenerates at the free boundary, characterized by the vanishing of pressure, 
of a typical polytrope in vacuum. In particular the particles at the boundary move as 
if in free fall. 

Perfect fluid bodies in vacuum with equation of state such that the density at 
the free boundary is non- vanishing are sometimes referred to as liquid bodies. An 
example of an equation of state of this type is 

P = Dip - pa) 

where D, pQ are suitable constants. For a steady fluid body with this equation of 
state, the density will be po at the boundary of the body. In this particular case, we 
also see that the sound speed does not go to zero at the boundary, and there is no 
degeneration of hyperbolicity. However, for liquid bodies this is not generally the 
case. See [6, Sect. 3.5] for discussion. 

For elastic bodies, like liquid bodies, we may expect the density of the material 
to be non-zero at the boundary, and hence there will be a jump in the density at the 
surface of the body. Further, for elastic bodies, we may expect that the field equations 
remain non-degenerate and hyperbolic up to boundary. For elastic bodies, the free 
boundary condition, which can be formulated as saying that the normal pressure at 
the boundary vanishes, is known as the zero traction boundary condition. 

Following the qualitative discussion above, we shall now mention some results on 
the Cauchy problem in continuum mechanics. First we consider infinitely extended 
bodies. For the case of fluids, Christodoulou [7] gives a conditions for shock forma- 
tion for small data, while for elastic materials John [8] gives a condition (genuine 
nonlinearity) under which small data lead to formation of singularities. Sideris [9] 
gives a version of the null condition for elasticity and proves global existence for 
small data. 

For bounded matter distributions, the situation is more complex. As mentioned 
above, for liquid or fluid bodies in vacuum, the hyperbolicity of the evolution equation 
degenerates at boundary. This problem can be overcome by using e.g. weighted 
energy estimates. See [10-12] for recent work on this problem. The Cauchy problem 
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for elastic bodies with free boundary can in Lagrange coordinates be written as 
a quasi-linear hyperbolic problem with boundary condition of Neumann type and 
treated using the methods of e.g. [13]. See [14, 15] for applications of these techniques 
in elasticity. 

If we on the other hand consider self-gravitating material bodies, much less is 
known. In fact, apart from some limited results which we shall mention below, the 
problem of constructing solutions of the initial value problem for self-gravitating 
liquid or fluid bodies in vacuum (both in Newtonian gravity and GR) is largely open. 

The Einstein equations imply hyperbolic equations for the components of curva- 
ture. Hence the irregularity at the boundary of a self-gravitating body could in general 
be expected to radiate into the the surrounding spacetime, preventing this from being 
regular, cf. Fig. 1. As this clearly does not occur for realistic self-gravitating bodies, 
there must be a geometric “conspiracy” at the boundary of a self-gravitating body 
undergoing a regular evolution in Einstein gravity. This then has to be reflected in 
compatibility conditions on the Cauchy data for such a body, see [16]. 

It has in recent work been possible to prove local well-posedness for the the initial 
value problem for self-gravitating elastic bodies in Newtonian gravity, cf. [15], and 
general relativity, cf. [17, 18], see also Sect. 4 below. In both cases, one finds that 
corner conditions on the initial data originating from the free boundary condition, 
which from a PDF point of view is of Neumann type, as well as compatibility 
conditions on the Cauchy data. 

If we turn to dynamical liquid or fluid bodies in general relativity, the results 
are quite limited. Choquet-Bruhat and Friedrich [19] considered the initial value 
problem for a dust body in Einstein gravity, assuming a density which is regular at 
the boundary. The work of Kind and Ehlers [20] on self-gravitating fluid bodies in 
general relativity restricts to spherical symmetry but allows a discontinuity at the 
boundary for the matter density. Rendall [21] was able to prove local well-posedness 
for Einstein-fluid bodies with certain restricted class of equations of state, and with 
smooth density at the boundary. 

Steady states of self-gravitating bodies provide in particular solutions of the initial 
value problem, and thus, apart from their intrinsic interest, a study of steady states 
gives useful information for the study of the dynamics of self-gravitating bodies. 



Fig. 1 It is a priori possible 
that the irregularity at the 
boundary of a body causes 
the surrounding spacetime to 
be irregular 
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Steady states of fluid configurations in Newtonian gravity may be complicated, 
examples are Dedekind and Jacobi ellipsoids, cf. [22, 23]. Lichtenstein [24] con- 
structed static and rotating fluid Newtonian fluid bodies. His results have been 
extended to elastic matter by Beig and Schmidt [25]. For general Newtonian liq- 
uid or fluid bodies there are only limited results available. Lindblad and Nordgren 
proved a priori estimates for incompressible Newtonian fluid bodies [26]. Further, 
problems of dynamics and stability of self-gravitating fluid and liquid bodies in 
Newtonian gravity have been studied by Solonnikov, see e.g. [27, 28] and references 
therein. 

Static self-gravitating fluid bodies are spherically symmetric, in Newtonian grav- 
ity as well as in general relativity, cf. [29]. Lindblom [30] gave an argument showing 
that viscous stationary fluids in GR are axi- symmetric. Heilig [31] constructed rotat- 
ing fluid bodies in GR. It is an open problem whether helically symmetric rotating 
states exist in GR, cf. [32-34] for related work. 

Although relativistic elasticity has been studied since shortly after the introduction 
of relativity, cf. [35] (special relativity), [36-39], until recently no existence or well- 
posedness results except in the spherically symmetric case, cf. [40]. Work by the 
author with Beig and Schmidt shows that there are examples of static self-gravitating 
elastic bodies in general relativity which have no symmetries, cf. [41]. Similarly, there 
are rigidly rotating self-gravitating elastic bodies in general relativity with minimal 
symmetry, i.e. which are stationary and axially symmetric [42]. 



2 Classical Elasticity 

An elastic body is described in terms of configurations with respect to a reference 
body B, a domain in the extended body 
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The configuration maps f from the physical spacetime to the reference body, and 
the deformation maps from the reference body to spacetime are assumed to satisfy 



/o0|g = id. 



The role of the configuration map / in the Eulerian variational formulation of elas- 
ticity in the context of general relativity has been stressed by Kijowski and Magli 
[38]. See the books by Marsden and Hughes [43] and Truesdell and Noll [44] for 
background on elasticity. 

The physical body moves in spacetime M with coordinates = (t,x^) 

and metric Coordinates as well as coordinate indices on B are denoted with 
capital letters, X^. It is convenient to endow the body B with a body metric Bab • For 
many situations, this can be taken to be the Euclidean metric Bab = ^ab • 

We start by considering the non-relativistic case. In the non-relativistic case it is 
natural to take x where is the space-manifold, metric gij , which in 

the non-relativistic case can be taken to be Euclidean. The action for a hyperelastic 
body in Newtonian gravity takes the form 

S= Adtd^x (2.1) 



where 



^ _ j^kin _ ^j^grav j^pot j^elast^ 



( 2 . 2 ) 



where 



= \pv\f-HBy 

^ \yyl 

SttG ’ 

= pUXf-^iBp 
=neXf-HB)- 

See [6, Sect. 3]. Here n = det9/ is the number density, and e = e(/, df) is the 
stored energy function, representing the internal energy of the material. We have, 
for clarity included the indicator function = XB ^ f of the physical body, 

where x(X) = 1 for X e B, and x(X) = 0 otherwise. The physical mass density 
is p = nm where m is the specific mass of the material particles. Eurther, U is the 
Newtonian potential and 

|Vt/|^ = diUdjUg‘J. 



The kinetic term in the action is defined in terms of the square velocity 
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= v‘yjgij, 

with the 3- velocity, given by representing the motion in space 

of the material particles. It should be stressed that the terms j^kin ^ j^elast 
supported on while the term should be viewed having support on the 

whole 

Remark 2.1 

1. Defining the Newtonian potential by the Poisson integral 



U{x) 



= -G f 



f~HB) 1 ^ - ^1 



(2.3) 



the term + A^^^ can be replaced by 

1 

:^pUXf-HB) 

2 The Lagrangian given in (2.2) is of the familiar form 



L = T -V 



with T, V the kinetic and potential terms, respectively. The corresponding Hamil- 
tonian (or energy) is then 

H = T + V 



The elastic stress tensor is 



Tj 



n—f^ ■ 

QfA/ 



This is the canonical energy-momentum tensor for the elastic part of the action. 
Assuming suitable asymptotic behavior for the fields, the Euler-Lagrange equation 
for the action (2.1) is 



pv^d^Vi X f-i (B) + dj in Af-HB)+ p9i U X /-i (B) = 0 (2-4) 



Now, an important fact is that the divergence 



djiTiAf-ns)) 

is a function in only if the normal stress vanishes at the boundary of the body, i.e. 
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cf. [41, Lemma 2.2]. This is due to the fact that the gradient of the indicator function 
is of the form 

diX^ 

where is the surface delta function. Thus, 



pv'^d^Vi -\- djTi^ -\- pdiU =Q, in/ *(S), (2.5a) 

=0 ondf-\B) (2.5b) 

coupled to the Poisson equation 



AC/ = 4TTGpXf-i(B) 



(2.5c) 



which has solution given by (2.3). Here 

= d, + v’di 

SO that 

gives the acceleration of the physical particles. Equation (2.5a) corresponds to New- 
ton’s force law F = ma, where now the force includes both force generated by 
elastic stress as well as the gravitational force, together with the free boundary, or 
zero traction, boundary condition (2.5b). The boundary condition represents the fact 
that the motion of the boundary is not subject to any external forces. 

We recall some facts from potential theory. We can write the Newtonian (volume) 
potential given by (2.3) as 



U = A \4nGpXf-i(B)) 



Differentiating gives 

d^iU = A~^[d^i4TrGp] - S[irgf-i^s^4TrGpiy‘], ( 2 . 6 ) 

where S is the layer potential and i'‘ is the normal to df~^{B). Similarly, d^iS can 
be expressed in terms of the double layer potential V. Standard estimates for 5, D 
and an inductive argument can be used to estimate U . Due to the jump in the matter 
density pXy-i(S) we have that d^U is discontinuous at However, U has 

full regularity up to See [6, Appendix A] for details. 

In the material frame (Lagrange coordinates) the physical body is represented by 
the deformation map 0(S). The material form of the action is got by simply pulling 
back the Lagrange density from the Eulerian picture (in spacetime) to get 
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^material — J ^ i^dtd x). 

The Euler-Lagrange equation can then be calculated purely in the material picture. 
An important simplification is gained due to the fact that the domain of the body in 
the material picture is the reference body B, which is time-independent. One finds 
that under suitable assumptions on the stored energy function, the Cauchy problem 
for the elastic body in material frame is an initial-boundary value problem on B with 
Neumann type boundary conditions. 

Since we have Ti^ = Ti^ the expression djTi-^ is a quasi-linear second 

order operator on /. Disregarding the gravitational self-interaction for the moment, 
hyperbolicity of the system (2.5) is determined by the properties of the elasticity 
tensor 



La 



d^e 



e.g. rank-one positivity 
or pointwise stability 

La j > jbABQ^^ 

where C is some positive constant. If one of these conditions hold, the system (2.5) 
forms a quasi-linear elliptic-hyperbolic system with Neumann-type boundary con- 
ditions. 

A formulation of elasticity compatible with general relativity requires the elastic 
action to be generally covariant. This implies that the stored energy function frame 
indifferent. Define the strain tensor 

and let A/, / = 1, 2, 3 be the fundamental invariants of 7 "^^ = The 

material is frame indifferent if e = e(/, 7^^) and isotropic if e = e(A/). 

Remark 2.2 

1. In the variational problem of classical elasticity (with energy determined purely 
by the elastic term), poly convexity [45], i.e. the condition 

e{F) = e(F, CofF, det F) 

where F = (0^^), with e convex, leads to cancellations which in certain circum- 
stances allow one to show convergence of minimizing sequences. 
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2. Small perturbations around a stress free state are governed by the quasi-linear 
wave equation 



d^4> - cjA(p - (cj - c^)graddiv0 = F(V0, 



cf. [46]. 

3 . The field equation of classical elasticity is analogous to membrane equation which 
has action 



S = 



/ 



y/\(p*g\ 



For the vacuum Einstein equation in wave coordinates, bounded curva- 
ture (which corresponds to regular data) implies local well-posedness [47] . 
For elasticity and the membrane equation, the analogous result would be well- 
posedness for regular initial data. 

A static body is in equilibrium, in particular, the elastic load must balance the load 
from e.g. the gravitational force. Further, in Newtonian gravity. Newtons principle 
actio est reactio implies further that each component of a body must be in equilibrium. 
The following equilibration condition is a consequence of the assumption that the 
total load on a body from elastic stress and gravitational force does not generate a 
motion. Gauss’ law and the zero traction boundary condition gives for any Euclidean 
Killing field f with 



L 



f~HB) 



CdiTj' = 



Idf-HB) 



CTj'tli = 0 , 



The body is static if the stress load balances the gravitational load 



9/r/ = bi := pdjU 



In particular such a load must be equilibrated 

f ebi = 0 , 

for any Killing field For a general load this is a non-trivial condition, but a 
gravitational load is automatically equilibrated. 

As mentioned above, it is convenient in applying PDF techniques to elastic bodies, 
to consider the system in the material frame. This is true both in the construction of 
steady states of Newtonian elasticity, see [25] and references therein, but also for the 
Cauchy problem. Assuming suitable constitutive relations, the initial value problem 
for a Newtonian self-gravitating body in material frame is an elliptic-hyperbolic 
system with Neumann type boundary conditions. Well-posedness has been proved 
for this system in [15], assuming suitable constitutive relations. This result gives the 
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first construction of self-gravitating dynamical extended bodies with no symmetries. 
One finds that the initial data must satisfy compatibility conditions induced by the 
Neumann boundary conditions. 



3 Elastic Bodies in General Relativity 



The action for an general relativistic elastic body is 

s = -j . (3.1) 

where A = A(/, 5/, g) = neXf-^^S) the energy density of the material in its 
own rest frame. Here we have included the indicator function x space-time 

trajectory of the body explicitely in the action. The relativistic number density is 
given by ^ = det( 7 ^^)t/^ with 7 ^^ = , and e is the stored energy 

function. As mentioned above, general covariance demands frame invariance, i.e. 
e = e(/, g)- 

The Euler-Lagrange equations for this action are the Einstein equations 



= "^TTGTfyjyXf-^{B) ’ 



(3.2a) 



where 



= Ruv - ^Rguv, = 2 ^^ 



-Rg, 



fiu 



The elasticity equations, including the free boundary condition 



where is the normal to the (typically time-like) boundary of the spacetime domain 
of the body are consequences of the conservation equation 



V^(VX7-i(^)=0 (3.2b) 

which in turn follows from the Einstein equation (3.2a), but which can also be 
derived as the Euler-Lagrange equation for the action with respect to variations of 
the configuration map. The field equations for a general relativistic elastic body may 
thus be viewed as the Einstein equation (3.2a) or, equivalently, as the coupled system 
(3.2). 
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3.1 Static Body in GR 



We next consider the case of static self-gravitating bodies in general relativity. Thus, 
we assume (M, is static, i.e. there is a global timelike, hypersurface orthogonal. 
Killing field Then we have that Ad = M x M and we may introduce coordinates 
= (t, x^) such that the Killing field ix = dt, with norm For 

a static spacetime we can write 

gapdx^dx^ = —e^^dt^ -h hijdx^ dx^ 

where U, hij depend only on x\ Kaluza-Klein reduction applied to (3.1) gives the 
action 



S = - ! -^^{Rh-2\VU\l)+ [ e^neVh 

JM lOTTCr 

The Euler-Lagrange equations are 



(3.3) 



= e^ine - a,‘)ViU in/-‘(B), (71^ njlf-^gs) = 0 

AhU = 4nGe^ (ne — a/)Xf-i(B) 

Gij = SwGi&ij - e^cTij Xf-i(B)) 

where 



©// = 



47tG 



WiUWjU--hij\WU\^ 



This system is equivalent to the 3 -h 1 dimensional Einstein equations for the static 
elastic body. 

Let a relaxed reference body B be given. Eor small G, we construct a static self- 
gravitating body, i.e. a solution to the static Einstein-elastic equations, which is a 
deformation of S, cf. [41]. The construction is carried out in the material frame. 
Working in harmonic coordinate gauge, the reduced Einstein-elastic system can be 
cast in the form 

A*(G, Z) = 0, 



where G is Newtons constant and Z denotes the fields in the material frame version 
of the system, i.e. the deformation map 0 as well as the material version of the 
Newtonian potential U and the 3-metric htj . Assuming suitable constitutive relations, 
the reduced system of Einstein-elastic equations is an elliptic boundary value problem 
with Neumann type boundary condition. Given a relaxed background configuration 
Zo, which can be viewed as a solution of the Einstein-elastic system with Newtons 
constant G = 0, we would like to apply the implicit function theorem to construct 
solutions to (3.2) for small G. 
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However, an obstacle to doing so is the fact that the linearized operator Z)z*F(0, Zq) 
necessarily fails to be an isomorphism. In fact, due to invariance properties of the the 
equilibration condition, the infinitesimal Euclidean motions, i.e. the Killing vector 
fields on Euclidean 3-space, are in the kernel. Eurther, due to the linearized operator 
Z)z^(0, Zq) has a non-trivial co-kemel, which also corresponds to the infinitesimal 
Euclidean motions. This is due to the fact that the the linearized elasticity operator 
at the reference configuration is automatically equilibrated. Thus, we have a kernel 
and cokernel corresponding to the Killing fields of the Euclidean reference metric on 
M and on Applying a projection to DzJ^iO, Zq) in order to get an isomorphism 
we are in a position to apply the implicit function theorem to construct a solution for 
small G to the projected system 



The proof is completed by showing that the solution to the projected system is auto- 
matically equilibrated, i.e. it is a solution to the full system, including the harmonic 
coordinate condition. 

By choosing the reference body to be non- symmetric, we thus get the first con- 
struction of self-gravitating static elastic bodies in general relativity with no symme- 
tries. Outside the body, the spacetime is a solution of the vacuum Einstein equations, 
which will be asymptotically fiat, but with no Killing vector fields except for the 
static Killing field. 

In Newtonian gravity there are many examples of static self-gravitating many- 
body systems, consisting of rigid bodies of the type shown schematically in Eig. 2. 
The method described above in the case of static self-gravitating bodies extends to 
A-body configurations [48]. In this case, one takes a Newtonian static configuration 
A -body configuration consisting of rigid, self-gravitating bodies as the starting point. 
Under some conditions on the Newtonian potential one can apply a deformation 



Z) = 0. 




Fig. 2 Examples of two-body configurations in equilibrium 
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Fig. 3 Bodies separated by 
a plane cannot be in 
equilibrium in Newtonian 
gravity. This holds in GR if 
the plane is replaced by a 
totally geodesic 
hypersurface. It would be 
interesting to find a more 
general characterization of 
what static configurations are 
possible 





technique related to that used in the construction of static self-gravitating bodies to 
construct A/^-body configurations. A particular case consist of placing a small body 
at a stationary point of the gravitational potential of a large body. 

The proof makes use of the additional degree of freedom corresponding to the 
difference in the centers of mass and alignments of the bodies to achieve equilibration. 
In Newtonian gravity, one proves easily that a two bodies separated by a plane cannot 
be in static equilibrium, cf. Fig. 3. This relates to Newton’s principle actio est reactio, 
also mentioned above, which implies that each body must be equilibrated with respect 
to its own self-gravity. 

In general relativity, we lack the concept of force (see however [49] for related 
ideas in the static case) and the problem of characterizing “allowed” /i-body configu- 
rations is open. Partial results on this problem have been proved by Beig and Schoen 
[50], and Beig et al. [51]. In particular, bodies separated by a totally geodesic surface 
cannot be in static equilibrium. 

In order to describe rotating, self-gravitating bodies, we must consider stationary 
spacetimes, i.e. spacetimes with a Killing field which in the relevant situation will 
be timelike, but not hypersurface orthogonal. In this case, Kaluza-Klein reduction 
gives action 



S = -f j^(i^h-2\DU\l+e^^\u\l)+ f nee^Vh, 

Jm IottG V / Jm 

In this case, one may use techniques related to those discussed above to construct 
self-gravitating rotating bodies in general relativity as deformations of axi-symmetric 
relaxed, non-rotating, reference states, see [42]. By choosing the reference body 
appropriately we get rigidly rotating self-gravitating elastic bodies with a minimal 
amount of symmetry, i.e. with no additional Killing vector fields than the stationary 
and axial Killing vector fields. The asymptotically fiat vacuum region surrounding the 
rotating body can in that case be shown to have exactly these two Killing symmetries. 
It is plausible that all stationary, asymptotically fiat spacetimes which are vacuum 
near infinity, are axisymmetric. 
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4 Dynamics of Elastic Bodies in General Relativity 



We write the Einstein-elastic system, cf. (3.2), in the form 



= STrG(T^„ - -Tgn„)Xf-i(B) 
V''7),, = 0 in /-‘(B) 



In order to construct solutions to the Einstein equation it is convenient to work in 
wave coordinates gauge. 



A standard calculation, cf. [52, Sect. 10.2] shows that with (4.1) imposed, the Einstein 
equation takes the form becomes a quasi-linear wave equation of the form 



where \3g = is the scalar d’Alembertian and is is an expression which 

is quadratic in derivatives of g^iy. Assuming suitable constitutive relations for the 
elastic material, the Einstein-elastic system now becomes a quasi-linear hyperbolic 
system, and one can proceed to construct solutions along standard lines. 

A serious obstacle however is the fact that the matter density has a jump at the 
surface of the body. This means that using standard techniques it appears difficult 
to prove local well-posedness for this system, even using sophisticated harmonic 
analysis techniques, as appears in the proof of the curvature conjecture. In a joint 

paper with Oliynyk [17] we have given a proof of local existence for solutions of 
quasi-linear systems with the appropriate discontinuity in the source term. There 
we have also given an outline of the application of the results of that paper to the 
Einstein-elastic system [17, Sect. 5]. Details will appear in a joint paper with Oliynyk 
and Schmidt [18]. 

An important aspect of the problem can be seen by considering the following 
model problem. In with coordinates (x^) = (r, v^), let □ = —df + A and 
consider the Cauchy problem 




(4.1) 




\3u = F(t, X, u, du)xQ, 
u(0) = u^, dtu(0) = u^. 



(4.2a) 

(4.2b) 



Let ui = dfu, Fi = dfF and let 5' be a given, sufficiently large integer, and let 
the spaces W be defined by 
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s = 0,l, 

n H^(Q) n \Q), s > 2. 



Suppose we are given data satisfying the compatibility conditions 



uiiO) e (4.3) 

and assume that Ff(-, t) e //■'“^(f2),for0 < £ < s. Time differentiating the equation 
yields 

i = ;■ = 0, . . . , s. (4.4) 

A standard energy estimate shows that Us+\ are bounded in and L^, respec- 
lively. One gets improved regularity for lower time derivatives by an induction argu- 
ment. From (4.4) for 7 = 5 ’ — 1, we have 

/^Us-i = Fs-ix^ + d^Us-i 
= Fs-iX^ + Us+\. 

The potential theory results mentioned in Sect. 2 imply that Us~\ G H}. Suppose 
now we have for £ > 1 an estimate for Us-i in in terms of the initial data and 
the bound on Fs-i in Then we have from Eq. (4.4) fox j = s — \ — I, 

/Xus-i-i = Fs-i-iX^ + Us+i-i e 

and the potential theory results we can now be used together with the assumptions on 
the initial data and F, to give an estimate for Us-i-i in Induction with i = I 

as base yields an estimate for . 

An argument similar to the above forms an important part in the proofs of local 
well-posedness in the papers [17, 18] mentioned above. The compatibiliary condi- 
tions (4.3) on initial data can be interpreted as implying that the body (or in the model 
problem, the source) existed and was regular in the past of the initial Cauchy surface, 
i.e. one must have the situation illustrated in Fig. 4. 



Fig. 4 The solution to the 
Cauchy problem is regular 
provided the Cauchy data is 
compatible with the source 
having existed in the past 



{t = 0} 



( 
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On Geodesic Dynamics in Deformed 
Black-Hole Fields 



O. Semerak and P. Sukova 



Abstract “Almost all” seems to be known about isolated stationary black holes in 
asymptotically flat space-times and about the behaviour of test matter and flelds in 
their backgrounds. The black holes likely present in galactic nuclei and in some 
X-ray binaries are commonly being represented by the Kerr metric, but actually 
they are not isolated (they are detected only thanks to a strong interaction with 
the surroundings), they are not stationary (black-hole sources are rather strongly 
variable) and they also probably do not live in an asymptotically flat universe. Such 
“perturbations” may query the classical black-hole theorems (how robust are the 
latter against them?) and certainly affect particles and flelds around, which can have 
observational consequences. In the present contribution we examine how the geodesic 
structure of the static and axially symmetric black-hole space-time responds to the 
presence of an additional matter in the form of a thin disc or ring. We use several 
different methods to show that geodesic motion may become chaotic, to reveal the 
strength and type of this irregularity and its dependence on parameters. The relevance 
of such an analysis for galactic nuclei is briefly commented on. 



1 Introduction 

Geodesic structure is a very comprehensive and demonstrative attribute of space 
(-time). As traversing regions of all possible sizes, geodesics can unveil a local behav- 
iour of a given system as well as tiny tendencies only discernible over an extensive 
span of time. A default example of the latter are weak irregularities attending a lack of 
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a complete set of integrals of the motion. Actually, the long-term geodesic dynamics 
is a suitable tool how to detect, illustrate, evaluate, classify and compare different 
deviations of a chosen system from a certain simple, “regular” ideal. In mathematics 
and physic, such an ideal is represented by linear systems (the finite-dimensional in 
particular). However, within the last 150 years it has become clear that even in these 
highly abstract fields the linear systems represent just marginal tips within a vast 
non-linear tangle which is typically prone to “irregularities” and which can display 
“chaotic” behaviour even in rather simple settings. 

The modem theory of chaos was apparently inspired by Henri Poincare’s treat- 
ment of a three-body system, and our interest will also focus on systems driven by 
gravitational interaction in this contribution. Specifically, we consider a simplest 
possible gravitational centre, the Schwarzschild black hole, and study how the time- 
like geodesics in its field respond on perturbation due to the presence of a very 
simple additional source, namely a static and axially symmetric thin annular disc 
or ring. Such a topic has an extra attraction in general relativity, due to the latter’s 
non-linearity. Besides that, as reminded in [1], it is doubly apropos to seek for chaos 
around black holes, because in ancient Greek CHAOS meant a gaping bottomless void, 
where everything falls endlessly . . .A more sophisticated reason was articulated nicely 
by [2]: “. . .even the most pristine black-hole space-time harbours the seeds of chaos 
in the form of isolated unstable orbits. A small perturbation causes these unstable 
orbits to break out and infect large regions of phase space. Note that the experience 
with Newtonian systems is very misleading. For example, the Kepler problem has 
more integrals of motion than are needed for integrability. Keplerian systems are 
thus impervious to small perturbations. In contrast, black hole space-times are at the 
edge of chaos, just waiting for the proverbial butterfly to flap its wings.” 

However, the space-time of a black hole enclosed by a disc or ring has not been 
chosen only for theoretical reasons: the black holes probably present in galactic nuclei 
and in some X-ray binaries are supposed to be interacting with their surroundings 
through disc accretion, and, besides that, galactic nuclei are also typically encircled, 
at larger distance, by tori of colder molecular gas and dust (called circum-nuclear 
rings). These structures (at least the inner accretion disc) are assumed to be rather 
lightweight with respect to the central black hole, and theoretical models actually treat 
them as test (not contributing to the gravitational field at all). Such an approximation 
is certainly adequate concerning the potential (metric) and its gradient (field), but 
may not hold for higher derivatives (curvature). However, the curvature terms are 
important for stability of the motion, so one can expect that a gravitating matter would 
assume a different configuration than a test one. This could be important in the physics 
of accretion discs, as the latter e.g. depends crucially on position of the disc inner 
radius (approximated by the innermost stable circular orbit); this in turn plays a key 
role in estimating the black-hole spin from observations. The issue was pointed out 
already by [3] on Newtonian grounds and later confirmed in analysis of the sequences 
of exact space-times describing the fields of a black hole with concentric thin discs 
(e.g. [4, 5] and references therein). It turned out, in particular, that the properties of 
circular motion in the disc plane are indeed altered due to the orbiting-matter own 
gravity, for example, the frequencies of epicyclic oscillations indicate that the disc 



On Geodesic Dynamics in Deformed Black-Hole Fields 



563 



may remain stable closer to the horizon, whereas it rather inclines to instability at 
larger (intermediate) radii (e.g. [6]). 

It should be stressed right away that a test body orbiting in a black-hole field 
has various other reasons why to behave in a chaotic way. Even if leaving aside the 
most probable perturbation, namely due to a mechanical interaction with the ambient 
medium, and focusing only on gravitational effects, the dynamics depends on how 
the body is described (whether it is point-like, i.e. without structure, or endowed 
with more multipoles than just mass), on whether/how it is influenced by incident 
gravitational and electromagnetic waves, and on whether/how the gravitational and 
electromagnetic emissions of the body and corresponding reactions on itself are taken 
into account. Besides that, in reality the body also feels back reaction due to its own 
gravity (it is not strictly test). Finally, one has to take care of what kind and amount of 
chaos is added by the numerical code alone: numerical truncations are easily under 
control for regular orbits along which a sufficient number of quantities (a “complete 
set of integrals”) remains constant, but for chaotic trajectories such a full check is not 
at hand. It is really confirmed by experience that even with just slightly modified code 
it is almost impossible to reproduce accurately a given long-term chaotic evolution. 
This urges caution in interpreting the results of chaotic-dynamics modelling and 
suggests not to trust it uncritically down to quantitative details. 

In such an area, it is of particular importance to use several independent methods 
(and, if possible, also different basic codes for integration of the equations of motion) 
and compare their outcomes. The rapid progress that the field of chaotic dynamics 
have been experiencing since the middle of the 20th century has indeed yielded a 
number of techniques which, starting from the beginning of 1990s, have been also 
applied to general relativistic systems. One can roughly divide them into two groups: 
those which require the knowledge of the system’s dynamics (evolution equations) 
and those which can manage with just series of values (time series of experimental 
data, for example). Of the first group, we employed the Poincare surfaces of section, 
the Lyapunov exponents and two other similar indicators (abbreviated as ELI and 
MEGNO) and also followed the evolution of the so called latitudinal action (given by 
the component of four-momentum which is not bound by any constant of motion). 
Concerning the second group of methods, we drew the power spectrum of a certain 
dynamical variable (“vertical” position of the particle in our case) and subjected the 
motion to two variants of recurrence analysis, one focused on directions in which 
the orbits recurrently traverse “pixels” of a prescribed phase-space grid and the other 
based on recurrences to the chosen cells themselves. 

Below we first describe our system in more detail. Then, in Sect. 3, we briefly out- 
line methods we have used to analyse the regime of geodesic dynamics in the chosen 
black-hole-disc/ring field, and review basic observations they provide. Comments 
concerning astrophysical relevance of the results and possible further extensions of 
the work are given in Concluding remarks. We will not go into details, especially 
not concerning the theory of chaos itself and the diagnostic methods used, focusing 
mainly on results and their interpretation. For a thorough account, please, see the 
papers [1, 7, 8] and references therein. Let us add that we use geometrized units in 
which c = \ and G = 1 and the metric (g^^z^) signature ( — h + +). 
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2 Static and Axially Symmetric Metrics for a Black Hole 
Surrounded by Discs or Rings 

Due to the non-linearity of Einstein equations, the fields of multiple sources are very 
difficult to find in general. In some situations, however, the equations simplify to a 
form which permits to solve at least a certain part of the problem in a Newtonian 
manner (namely, certain metric components superpose linearly). One very impor- 
tant example of such a setting is a static and axially symmetric case. Choosing the 
parameters of these symmetries as the time and azimuthal coordinates, t and 0, the 
vacuum metric can always be written then in the Weyl form 



involving just two unknown functions v and A depending only on coordinates p 
and z (cylindrical-type radius and “vertical” axis) which cover the meridional sur- 
faces. The y function represents Newtonian gravitational potential and is given by 
Laplace equation, hence it superposes linearly. The other function A represents non- 
Newtonian part of the problem. It is given by line integration of the derivatives of z/, 
reducing to zero along the vacuum parts of the axis; it does not superpose linearly 
and except some special cases it has to be found numerically. 

In general relativity, the motion has 3 degrees of freedom in general, because four- 
velocity is always constrained by normalization g^^^u^u^ = —1. In the above 
space-times, we have two independent constants of geodesic motion thanks to the 
stationarity and axial symmetry, namely energy and angular momentum with respect 
to infinity per unit particle mass, £ = —gttu^ and I = g(f)(f)U^. In contrast to the space- 
times of isolated stationary black holes, there is no irreducible Killing tensor and 
consequently no other independent conserved quantity (the so-called Carter constant 
quadratic in four- velocity), which implies that the geodesic motion may become 
chaotic. 

We have been interested in time-like geodesic dynamics in the field of a Schwarz- 
schild black hole surrounded, in a concentric way, by an annular thin disc or ring. 
Specifically, we considered one of the discs (mainly the first one) of the counter- 
rotating Morgan-Morgan family, inverted (Kelvin-transformed) with respect to their 
rim(seee.g. [4, 9]),^ and also the Bach- Weyl solution for a circular ring (e.g. [10, 11]). 
The Newtonian surface densities of the inverted Morgan-Morgan (iMM) discs are 



^ We also checked that the results are similar for discs of the family with power-law density profile 
[5] which are however more demanding computationally. 



ds^ = + p^e~^^d(t? + + dz^) 



( 1 ) 




( 2 ) 
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and their fields are described by the potentials 
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where the coefficients read 



^(m) _ (-1)"(4» + l)(2»)!(m + n)\ 
“ (n!)2(m - n)!(2m + 2n + 1)! 



< m), 



Plniy) are Legendre polynomials and Q2n(^^) are Legendre functions of the second 
kind, M and h are the disc mass and Weyl inner radius and (x,y) are oblate spheroidal 
coordinates related to the Weyl coordinates by 



= b^(x^ + 1)(1 - y^), z = bxy. 



On the symmetry axis (p = 0) where x = ^ , y = signz, 
, ^ = 1 , the disc potentials simplify to 
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Our second external source, the Bach- Weyl ring (of mass A4 and Weyl radius b), 
generates potential 



z^w = — 



2MK(k) 

Trh 



h,2 = 'JipT by + z2, 



(4) 



where K {k) = 



'nil 
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is the complete elliptic integral of the 1st kind, with 



modulus and complementary modulus given by 



(h)^ ^ 4b p 
(h)^ (/2)2’ 



k'^ = 1- 
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ih)^' 



Especially on the axis p = 0, one has k = 0, K = tt/ 2, so z^w = • 

Our main goal has been to analyse the behaviour of the geodesic flow in depen- 
dence on parameters of the system, namely on the relative mass and radius of the 
disc/ring and on energy and angular momentum of the particles. It should be admitted 
that both sources are singular (the discs are 2D in space and the ring is even ID) and 
curvature scalars really diverge at the ring as well as at the inner edge of the first 
iMM disc which we consider mostly (the higher-m is the disc, the less singular it 
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is at the edge). Hence, it is desirable to exclude from study the orbits which would 
approach the ring or the inner edge of the disc too closely, because there the space- 
time hardly corresponds to the as trophy sical fields we want to approximate. Anyway, 
we assume that the test particles do not interact with the external source mechanically 
(in particular, they traverse the disc without collision). 



3 Geodesic Chaos in the Black-Hole-Disc/Ring Fields 

Turning first to Poincare surfaces, we were recording transitions of suitably chosen 
large sets of bound orbits through the equatorial plane (the plane of symmetry defined 
by the disc or ring) and drew the passages in terms of a radial component of four- 
velocity against radial coordinate. The results showed typical properties of a weakly 
non-integrable dynamical system, well described by the KAM theory. They were 
also confirmed by power spectra which we computed from vertical-coordinate time 
series, and by evolution of the “latitudinal action” given by integral of the latitudinal 
component of momentum over an orbital period. (The character of dynamics can be 
well estimated from the time series itself, as well as from a spatial plot of the orbit.) 
Without repeating details from [1], let us list our main observations: 

• Geodesic flow tends to irregularity with increasing relative mass of the external 
source. The strongest chaos typically occurs when the disc mass is comparable 
to that of the black hole (for the ring it happens about 1/10 of the black-hole 
mass). For still heavier external source, the system rather returns to a more regular 
behaviour. 

• Similarly with the dependence on particle energy: chaos first develops with £ 
increasing, but for very high values it rather reduces back. 

• The above are just overall tendencies, however the change of the system with 
parameters is by no means smooth. Indeed, one of the most typical and interesting 
aspects of chaotic dynamics is its chaotic dependence on parameters, with abrupt 
changes of the phase portrait occurring/disappearing within very narrow parameter 
ranges. 

• The dependence on angular momentum I is opposite: larger I means larger part 
of energy allotted to azimuthal motion; and this component of motion is “held” 
exactly by the conserved value of f , so its larger value favours regularity. 

• The ring being stronger source than the disc, it also induces stronger perturba- 
tion of the geodesic dynamics. In particular, the ring presence within the region 
accessible to the particles generates so many higher-periodic regular islands (sur- 
rounded by chaotic layers) that the resulting Poincare diagrams can (also) be used 
as sophisticated wallpapers. However, if close encounters of the particles with the 
ring are prevented, the geodesic flow gets only moderately chaotic. 

• On (equatorial) Poincare sections, the geodesic dynamics rather tends to break 
up from the boundaries of the accessible phase-space region (these boundaries 
correspond to a zero vertical/latitudinal component of velocity, thus to a motion 
within the equatorial plane), while a certain regular region often survives in the 
interior. 
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• Chaos is a “global” phenomenon and its notion is being used in connection with 
sufficient time spans, yet it is possible (and practically even inevitable) to speak 
of a degree of (ir)regularity of a certain restricted section of an orbit. Different 
sections of the same orbit may display very different degrees of chaoticity and 
the dynamics may switch between these modes quite suddenly. Orbits with such a 
variable behaviour are typical for weakly perturbed systems and at the same time 
they are most interesting for comparing different methods. 

• Consistently with experience from the literature, such “weakly chaotic” (parts of 
the) orbits which at times range over the chaotic “sea” but also linger for a consid- 
erable intervals very close to regular regions of phase space (they “stick” to them, 
hence sticky orbits) rather produce “1/frequency” spectrum (i.e. approximated by a 
straight line in the log-log plot), whereas the “strongly chaotic” (parts of the) orbits 
typically produce “cat-back” spectral profile with (say, 100-times) less power at 
the low-frequency end, with less distinct peaks and more of tiny irregularities. 

In [7] we subjected the above dynamical system to two recurrence methods. The 
first of them, abbreviated WADV (from “weighted average of directional vectors”), 
was proposed by [12] as a way to distinguish between deterministic and random 
systems. Employing it within general relativity for the first time (as far as we know), 
we found it is even fairly sensitive to different degrees of (deterministic) chaos, 
especially in the regime of very weak perturbations (which we are mainly interested 
in). Briefly speaking, one takes a time series of some dynamical variable (we take 
vertical coordinate z(r) as a function of the particle proper time r) and reconstructs 
the (3D) phase space from z(r), z(r — Ar), z(r — 2Ar), where Ar is some time 
delay; this space is then “rasterized” into a grid of boxes of some chosen size. The 
main point is to add (unit) directions in which orbits recurrently cross the prescribed 
boxes, average the length of the accumulated vector over the cells with respect to the 
number of passages and analyse its dependence on Ar (and on the box size). Finally, 
the result is evaluated against its counter-part obtained for unit- step random walk, 
namely, one computes 



where Vj is the norm of the vector accumulated after nj passages through the jth 
box and 



A = average 




over cells (over j) 



( 5 ) 




denotes the average shift per step for random walk in d dimensions, obtained for 
n steps (with particular value given for J = 3 which is relevant in our case). Now, 
the A realizes a normalized autocorrelation parameter: in theoretical limit (infinite 



568 



O. Semerak and R Sukova 



evolution and infinitely fine grain), A = 1 for a deterministic signal, whereas A = 0 
for random data; in practice, A(Ar) more or less decreases from 1, the faster the 
more chaotic (or even random) is the evolution. 

The second recurrence method we applied rests on recurrences themselves of the 
orbits to chosen neighbourhoods of their past points. Denoting by X(/) = X(t(/)) the 
N successive points of a phase-space trajectory,^ the recurrences are simply recorded 
in the so called recurrence matrix 



/?,j(e) = 0(e- llX(i)-Xo)ll), iJ = l,...,N, (6) 

where e is the radius of a chosen neighbourhood (the selected threshold of “close 
return”), || • || denotes the chosen norm (the picture of long-term dynamics only 
slightly depends on which one is used) and © is the Heaviside step function. The 
matrix thus contains only units and zeros and can be easily visualized by filling black 
dots (units) or blank spaces (zeros) at the respective coordinates (/, 7 ); such figures 
are called recurrence plots and were introduced by [13]. For regular systems, the 
recurrences (black points) arrange in distinct structures, in particular in long parallel 
diagonal lines (their distance scales with period), and weak chaos brings checker- 
board structures, whereas for random systems the recurrences are scattered without 
order; chaotic (deterministic) systems yield the most interesting plots, consisting of 
rectangular blocks of almost-diagonal patterns as well as irregular ones, often looking 
like placed one over another. The pattern of recurrences yields rich and credible data 
which can be further processed in numerous ways (see [14] for a thorough survey). 
Within general relativity, the method has been employed e.g. by [15, 16]. 

Finally, one of the most characteristic symptoms of chaos is a sensitive dependence 
on initial conditions, following from the fast deviation of trajectories in the phase 
space. This divergence can be quantified by various indicators, of which Lyapunov 
characteristic exponents (A/ ) are the most well known, but other useful suggestions 
have also been given. The divergence quantifiers can be computed in two ways, 
either by following two nearby trajectories and the evolution of their phase-space 
separation, or by solving an appropriate variational equation (geodesic-deviation 
equation in our case) along one trajectory. We have adhered to the first, two-particle 
approach, using the procedure proposed — within general relativity — by [17, 18]. 
They argued, in particular, that it is sufficient to compute the orbital divergence in 
configuration space only, without including the momenta. This claim seems plausible 
and we have followed it, but a careful verification is still to be performed. Anyway, 
of the Lyapunov exponents, each characterizing the rate of separation change in a 
certain (/ -th) independent direction, the most important is the maximal one (which 
prevails automatically in a longer evolution). 



^The series of just one variable (e.g. position) suffices actually, since the phase space can be 
reconstructed from a sequence of its time-delayed copies as in the WADV method summarized 
above. 
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Amax = lim - In (7) 

r-^oo T I Ax (0)1 

We use small x for the position in configuration space, X = (x^, p^), so Ax is the 
separation vector whose norm represents the momentary proper distance between 
the two neighbouring orbits. If Amax > 0, than at least one direction exists in which 
the nearby orbits deviate exponentially.^ 

Due to typically very slow convergence of the above limit, a number of related 
quantifiers has been proposed whose computation converges faster and which thus 
reveal the nature of orbits in a significantly shorter integration time. We have tried 
two of them, often used in celestial and galactic mechanics, iho. fast Lyapunov indi- 
cator (FLI) and the mean exponential growth of nearby orbits (MEGNO). The FLI, 
suggested by [19], is given by 



FLI(r) = logio 



|Ax(r)| 

|5(0)| 



( 8 ) 



(restricting only to the configuration- space separation again). FLI(r) grows consid- 
erably faster for chaotic than for regular trajectories and this trend is evident much 
earlier than Amax(^) approaches its limit value. In general relativity (motion in black- 
hole fields), the FLI has been employed by [20, 21]. The second indicator, MEGNO, 
was proposed by [22] as 



Y(T) = 



2 r 1 

T J |Ax(r)| 
0 



d|Ax(r)| 

dr 



r dr. 



( 9 ) 



The content of this quantity is simply its value (rather than rate of growth or even 
“irregularity of behaviour”), which makes it suitable for automatic surveys over large 
areas of phase space. Importantly, the MEGNO distinguishes between regular and 
chaotic evolutions securely, because for regular orbits it tends to 2 (with an additional 
bounded oscillating term), whereas for chaotic orbits it grows linearly, with a slope 
corresponding to the value of the maximal Lyapunov exponent (F ~ AmaxT for large 
enough proper time). A few years ago, [23] found an analytic relation between FLI 
and MEGNO, 



Y(T) = 2 [FLI(T) - FLI(T)] In(lO), (10) 

where FLI(T) is the FLI time averaged over the period (0, T), 



^The exponents should reveal the nature of the flow in the vicinity of the reference world-line, hence, 
while time is running, the separation vector has to be renormalized whenever it reaches a certain 
“too large” value; the velocity deviation vector, given by difference between four-velocities of the 
neighbouring world-lines, has to be renormalized by the same factor. 
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T 

FU(r) = ^J FLI(T)dr. 

0 

Note that one can in turn average the MEGNO over some period in a similar manner 
(cf. the last part of Fig. 8). 

We presented the results obtained using Lyapunov exponents, ELI and MEGNO 
in paper [8], together with more details and remarks e.g. on often queried (non-) 
invariance of the world-line deviation indicators. A thorough comparison of these 
quantities with other similar indicators has recently been given by [24] using the 
variational approach. 



3.1 Relations Between Chaotic Indicators — An Example 



Besides relations like that between the ELI and MEGNO indicators (which are of 
similar nature), yet more interesting are relations between quantities whose origins 
are more independent. One illustration is an estimate of the maximal Lyapunov 
exponent which can be obtained from the MEGNO slope as indicated above, but also 
(for instance) from the 2nd-order Renyi’s entropy (also called correlation entropy), 
one of the indicators derived from the recurrence analysis. Namely, [14] showed that 
this quantity yields a lower estimate of the sum of positive Lyapunov exponents, and 
they also suggested that it can be approximated by 

K2{eJ)^-^\npc{eJ)^ ( 11 ) 

I At 

where Ar is the sampling time step and is the probability of finding a 

diagonal line whose length is at least I in the recurrence matrix (e is the radius of a 
chosen “close neighbourhood”). This means that K 2 is estimated from a slope of the 
cumulative histogram of diagonal lines (of diagonals at least I points long), plotted in 
logarithmic scale against the length /. We computed both estimates (from MEGNO 
slope and from Renyi’s entropy) and confirmed that they are in line with the values 
of Amax obtained by direct computation. 

Let us make a remark concerning the “value/price ratio” of the above indicators. 
Whereas MEGNO was introduced as a handy “condensate” of Lyapunov exponents 
which can be quite easily implemented in automated scans, the Renyi’s entropy is 
more sophisticated, but also less accessible for routine computer evaluation, mainly 
because it has to be determined from a proper part of the histogram. Namely, only 
a certain middle section of the histogram, reasonably following a straight line, is 
relevant, since the short-length end typically diverges due to increasing number of 
“sojourn” points (successive points between which the orbit does not leave the given 
e-neighbourhood"^), while the long-length end typically falls off quickly due to a 



^These points does not represent true recurrences and are usually discarded from the statistics. 
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finite length of the trajectory. One should stress that the histogram is theoretically 
to be computed in the limit I oo, so the short-length end has actually no sense, 
while the long-length end is of course determined by the fact that practically the 
trajectories cannot be infinitely long. We are depicting this to point out the benefits 
of some simpler recurrence quantifiers, especially of the one called DIV, given by 
inverse of the length of the longest diagonal. We compared the K 2 entropy with the 
DIV, in particular, we coloured several Poincare diagrams according to the values of 
K 2 and DIV obtained for the orbits followed in a given run, and observed that both 
the quantifiers provide virtually the same information (cf. Fig. 7). 



4 Numerical Illustrations 

Let US give several examples of how the varying degrees of chaoticity are revealed 
by the methods we have mentioned above. The passages through a suitably chosen 
plane, plotted in a suitably chosen variables (called Poincare maps), provide a direct, 
reliable and indicative picture of a studied motion. We naturally start with them, 
showing how the nature of geodesic flow around a black hole changes when a more 
and more massive thin disc (Fig. 1) or ring (Fig. 2) are placed around in a concentric 
manner. In both cases, the passages are recorded of some bunch of bound time- 
like geodesics through the equatorial plane of the system, and plotted in terms of the 
Schwarzschild-type radius r and of the corresponding component of four- velocity . 
Figure 3 compares a regular geodesic with a chaotic one, using four representations: 
their spatial tracks, equatorial Poincare sections, time dependence of the “vertical” 
(z) position and the corresponding power spectra. Figure 4 compares two different 
parts of the same geodesic, one “sticking” to a regular island and the other filling a 
chaotic sea. Four plots are shown — equatorial Poincare sections, time series of the 

four- velocity component, the power spectra of z(t) and, in the second part of the 
figure, recurrence plots; the difference between the two orbital phases is clearly seen 
in all of them. Four different parts of the same orbit are also shown in Fig. 5 and 
their increasing chaoticity revealed on the evolution and on the power spectra 
of z(t). 

The above representations are illustrative for a human eye, but less suitable for a 
quantitative judgement about the “degree of chaoticity” of the system, mainly if such 
a judgement should be entrusted to a computer. The recurrence methods we tested 
offer several “quantifiers” that are more suitable in this respect and which assign 
to every single orbit a single number or a single dependence on some parameter. 
In the first method, the character of the orbit is recognized from the average of 
directions in which the orbit recurrently crosses the prescribed phase-space cells (the 
Kaplan-Glass WADV method described in previous section). In Fig. 6, this averaged 
autocorrelation parameter is compared for a geodesic lying deep in a primary regular 
island and for a one inhabiting a large chaotic region; also added are the respective 
Poincare diagrams and z(0 evolutions. Figure 7 presents several quantifiers computed 
from the recurrence matrix and requires some commentary. The quantifiers shown 



572 



O. Semerak and R Sukova 




Fig. 1 Passages of geodesics with i = 3.1 5 M and S = 0.955 through the equatorial plane of a 
Schwarzschild black hole (with mass M) surrounded by the inverted 1st Morgan-Morgan disc (with 
inner Schwarzschild radius raise = 20M), drawn for different relative disc masses M./M (indicated 
in the plots). Red line bounds the accessible region. Adopted from [1] 
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Fig. 2 Passages of geodesics with i = 3.1 5 M and S = 0.977 through the equatorial plane of a 
Schwarzschild black hole (with mass M) surrounded by a Bach-Weyl ring (with Schwarzschild 
radius rnng = 20M), drawn for different relative ring masses M/M (indicated in the plots). Red line 
bounds the accessible region. With growing mass of the ring, the phase space apparently becomes 
very complex, containing many islands formed by higher-periodic regular orbits, interwoven with 
chaotic layers. For very high relative masses, the primary regular island restores and dominates the 
plot again. However, if the particles could not get close to the ring (if the latter did not lie within 
their accessible region), their motion would be considerably less irregular. The figure is adopted 
from [1] 



are called RR, DET, DIV and V ENTROPY (see [14]). The recurrence rate RR is 
given by ratio of the recurrence points (black ones) within all points of the recurrence 
matrix, 

1 ^ 

^ 

i,jM 
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Fig. 2 (continued) 



The DIV (“divergence”) quantifier is just reciprocal of the length of the recurrence- 
matrix longest diagonal. The DET (“determinism”) is a ratio of the points which 
form a diagonal line longer than a certain /min within all recurrence points, 



DET(e) = 



— fnun ^ 



where P(e, /) denotes histogram of lengths of the diagonal lines (length spectrum 
of the diagonals). Einally, the V ENTROPY represents the Shannon entropy of the 
probability p(v) that a vertical line has length v. 
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Fig. 3 Example of a regular {left) and chaotic {right) geodesic, both with 8 = 0.955 and f = 3 .75M, 
in the field of a Schwarzschild black hole surrounded by the inverted 1st Morgan-Morgan disc with 
M = 0.5M, raise = 20M. From top to bottom, the rows show their spatial tracks, Poincare sections 
z = 0 (r, u^), time series of the z position and corresponding power spectra. Coordinates are in [M], 
frequencies in [1/M]. Adopted from [1] 
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Fig. 4 Example of a “sticky” {left column) and strongly chaotic {right column) phases of one and 
the same geodesic (having E = 0.956 and i = AM) in the field of a black hole plus an inverted 1st 
Morgan-Morgan disc (with M. = 1.3M, rdisc = 20M): Poincare sections z = 0 {r,u^), time series 
of u^ and power spectra of the z position. Adopted from [7]. Recurrence plots for the same two 
orbital sections. Considering the recurrence-matrix symmetry, we give just halves of the plots in 
one square: the weakly chaotic section is above the main diagonal, while the strongly chaotic one 
is below the main diagonal. The axis values indicate proper time in units of M. Adopted from [7] 



N 

V ENTROPY = - ^ p{e, v) In p(e, v ) , 



P(e, v) 



P(e, v) 



Z N 

V=Vmm 



P{e, V)' 



In Fig. 7 we went along the t/'" = 0 axis of the Poincare diagrams and plotted the 
value of the above quantifiers for geodesics starting tangentially from the respective 
radii; it represents 470 geodesics in total, starting from radii between 5M and 24M. 
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Fig. 4 (continued) 



As already noted at the end of previous section, the colouring of the Poincare maps 
by the values of the “rough” quantity DIV and its “sophisticated” relative K 2 shows 
that both bring practically the same information. It is also obvious that the quantifiers 
(including DIV) are quite sensitive and can uncover even tiny features of phase space. 

In the last two figures, the quantifiers of geodesic deviation are illustrated and 
their message compared with that provided by previous methods. Three orbits of 
different degrees of chaoticity are analyzed in Fig. 8, first on Poincare sections and 
by computing the maximal Lyapunov exponents, then on power spectra of z{t) and 
on behaviour of the Kaplan-Glass directional autocorrelation in dependence on time 
shift applied to z(t), and finally by computing the FLI and MEGNO indicators. 
All the methods clearly distinguish between the orbits and yield consistent results. 
Figure 9 presents Poincare maps of three different phase-space situations, coloured 
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Fig. 5 The it) behaviour (left column) and power spectra of z(t) evolution (right column) of 
four different sections of the same orbit as in Fig. 4. Chaoticity clearly grows from top to bottom. 
Adopted from [7] 
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Fig. 6 Difference between a geodesic lying deep in a primary regular island {left column) and the 
one living in a chaotic sea {right column), as seen on Poincare diagram {top), z{t) evolution {middle) 
and the Kaplan-Glass averaged autocorrelation parameter A(Ar) {bottom). Both geodesics have 
S = 0.93 and I = 3.75M and the background is determined by a black hole surrounded by a 
Bach-Weyl ring with M = 0.5M and rnng = 20M. Adopted from [7] 



by FLI(rmax) and MEGNO(rmax)- The quantities apparently provide almost the same 
message, but MEGNO is somewhat more helpful, because, due to its definite value 
of 2 for regular orbits, it is more precise in distinguishing them from the chaotic 
ones (whose MEGNO value is bigger); to benefit from this graphically, one can, for 
example, increase all the values of MEGNO indicating chaotic evolution by some 
constant (we did it with all values above 4, in order to be on the safe side). 
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Fig. 7 Examples of quantifiers extracted from the recurrence matrix, as computed for 470 geodesics 
launched with E = 0.9532 , 1 = 3.75M tangentially (with = 0) from radii between 5M and 24M 
from equatorial plane of a black hole (M) surrounded by the inverted 1st MM disc {M = 0.5M, 
r^isc = 18M). The orbits were followed for about 250,000M of proper time with “sampling” 
Ar = 45M. In Poincare diagram {top), they are coloured according to their DIV (left) and K 2 
{right) values (log scale). The quantifiers shown (see text for description) are clearly sensitive to 
tiny phase-space features (behaviour along the top-diagram r-axis). Adopted from [7] 
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Fig. 8 Poincare sections (left) and maximal Lyapunov exponents {right) for three orbits (D, E, F) of 
different degrees of irregularity, with parameters Af = 0.94M, r^isc = 20M, S = 0.947, t = AM in the 
upper row (D); Ai = 13M, r^isc = 20M, S = 0.9365, t = AM in the middle row (E); and = 1.3M, 
^disc = 20M, S = 0.941, i = AM in the bottom row (F). The orbits fill phase-space layers of different 
volumes, in agreement with the obtained values Amax(^max) = 9.88- 10“^M“^ {top), 4.28- 10“^M“^ 
{middle) and 2.25 • 10“^ {bottom). The Poincare- surface passages are coloured by proper time 

(it increases in the order blue green yellow red). Power spectra of z{t) {left column) 
and the Kaplan-Glass average of the recursion directions A {right column), plotted for the same 
three orbits D, E, F (for their parts up to Tmax = lO^M). The A(Ar) dependence is drawn for three 
separate time-shift (Ar) intervals, (2500-3300)M, (25,000-25, 800)M and (100,000-100,800)M. 
Note that vertical-axes ranges are different on the right, 0.5-1. 0 for the top orbit whereas 0. 1-0.6 
for the remaining two. FLI(r) on the left and Y (r) (MEGNO) on the right, computed for the same 
three orbits (D, E, F). The average MEGNO 2Y (r) is drawn in blue and its linear fit is in red. The 
maximal-Lyapunov-exponent values inferred from the average-MEGNO slope are 9.172- 10“^ M~ ^ 
for orbit D {bottom), 4.192 • 10“^M“^ for orbit E {middle) and 2.278 • 10“^M“^ for orbit F {top). 
All three plots are adopted from [8] 
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Fig. 8 (continued) 
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Fig. 9 Equatorial Poincare maps of three phase portraits, coloured according to FLI(rmax) {left 
column) and F(rniax) {right column) values, where Tmax = 250,000M. Parameters: M = M/2, 
raise = 1 8M, ^ = 0.9532, f = 3 .75M in the 1st row, M = 13M, raise = 20M, S = 0.9365, f = 4M in 
the 2nd row, and M. = 13M, raise = 20M, S = 0.94 l,t = AM in the 3rd row. Whenever Y (rmax) > A, 
we add another 200 to it in order to enhance distinction between regular and weakly chaotic regions. 
The colours going from blue to red in the visible- spectrum order correspond to FLI increasing from 
0 to 350 {left) and to MEGNO increasing from 0 to 500. Adopted from [8] 




584 



O. Semerak and R Sukova 



5 Concluding Remarks 



In the Introduction, we indicated black-hole dominated cores of galactic nuclei as a 
natural astrophysical motivation. Indeed, the individual stars there can be approxi- 
mated as point test particles and if the environment is sufficiently sparse, their motion 
is close to a geodesic one. In our Galaxy, for instance, there is a black hole of mass 
M = 4.3 • 10^ Mq, probably surrounded by a rather tiny accretion structure and by 
two much larger circum-nuclear rings, one with radius 2 parsecs and mass M/10 
and the other around the 80 parsec radius with mass lOM. (These are average values 
learned from literature, see [8] for references.) There is also a considerable nuclear 
star cluster which could be incorporated by adding a suitable spherical term in our 
potential u. Of the above structures, only the smaller circum-nuclear ring has been 
observed to be able to partially destabilize the motion of stars, and only if their orbits 
can get sufficiently close to it. 

The dynamics of astrophysical systems clearly has observable consequences con- 
veyed by electromagnetic as well as gravitational radiation. The character of dynam- 
ics is crucial for the long-term evolution of these systems, although examples like 
the motion of stars in galactic nuclei are happening over too long intervals to provide 
well detectable signatures of chaos in “real time”. The characteristic periods are of 
course the shorter the closer to the horizon the motion takes place and [25] argued that 
it could be possible to recognize whether the central object is Kerr- like or different 
from the radiation of an inspiralling captured geodesic orbiter. Without doubt, the 
astrophysical black holes should differ from the Kerr ideal for other reasons (than 
considered here) as well: they must be interacting (if only to be “observable”), so not 
only they are not isolated, but also not stationary; and probably they do not live in an 
asymptotically fiat universe. In any case, the study of motion in deformed black-hole 
fields can lead to fundamental questions concerning the nature of objects supposed 
to play a key role in the most engaging cosmic systems. 

Let us conclude by several suggestions of further possible work. One could cer- 
tainly subject the system to still another methods and codes in order to check the 
results, but also to evaluate the methods/codes themselves and their practical features 
within general relativity; such analyses like [24] (on the usage of several indicators of 
orbital deviation in GR) will be helpful. For example, the Melnikov-integral method 
has already been employed several times in general relativity, as well as the study 
of the dimension of “basin boundaries” (boundaries between initial-condition sets 
which evolve to distinct end states), which are of particular appeal due to their 
coordinate-independent message. It would be interesting to perform a similar analy- 
sis for the “corresponding” (pseudo-)Newtonian or post-Newtonian systems, at least 
to infer how good approximations they provide. Also, for better insight, one should 
study in detail the unstable periodic orbits and their asymptotic orbits whose behav- 
iour under perturbation is crucial for the system properties. 

The other direction is to make the model of the astrophysical system more real- 
istic. Speaking about the galactic nucleus, one could include the central star cluster 
(at least in terms of a spherical potential), replace the thin ring by a toroid of finite 
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cross-section, or/and possibly try to account for mechanical interaction of the orbiters 
with the putative gaseous medium. From the point of view of general relativity, 
it would however be most interesting to include rotation (which generates drag- 
ging effects, no longer compatible with static solution). Actually, rotation is almost 
omnipresent in astrophysics and mainly expected in the case of very compact bodies 
like black holes, and certainly in the case of orbiting matter like that of accretion discs. 
Unfortunately, it has proved very difficult to extend the scope from static to station- 
ary axisymmetric exact setting. It is well known that under such symmetries Einstein 
equations reduce to the Ernst equation which is completely integrable, but explicit 
solutions of the corresponding boundary-value problem are rather involved [26-28] 
and not friendly as backgrounds for further extensive numerical studies. Although 
several related algorithms called generating techniques have been recognized which 
can in principle provide any solution with given (two commuting) symmetries (e.g. 
[29]), none of those actually considered in more detail turned out to have satisfac- 
tory properties (besides basic ones already familiar before, like the Kerr metric). In 
particular, simple attempts to “generate” a physically reasonable metric for a black 
hole surrounded by an additional matter or field have not been very successful (e.g. 
[30-34] and references therein). 

To conclude with the main topic of this seminar, let us also notice that in reality the 
body whose motion is studied differs from the point-test-particle ideal (and thus the 
motion from a geodesic one): it is likely endowed with spin or even higher moments; 
the radiation reaction due to its gravitational emission could be taken into account as 
well as back reaction due to its non-zero effect on space-time geometry; and, needless 
to say, if the orbiter was charged, it would also be affected by electromagnetic field, 
if there was any around. It would be sensible to employ the methods used in the 
theory of chaos in order to estimate and compare the significance of various such 
“perturbations” which affect the motion in real astrophysical situations. 
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Higher Order Post-Newtonian Dynamics 
of Compact Binary Systems 
in Hamiltonian Form 



Gerhard Schafer 



Abstract The Hamiltonian formalism developed by Amowitt, Deser, and Misner 
(ADM) is used to derive and discuss the higher order post-Newtonian dynamics and 
motion of compact binary systems in general relativity including proper rotation 
of the components. Various explicit analytic Hamiltonians will be presented for 
the conservative and dissipative dynamics, the latter resulting from gravitational 
radiation damping. Explicit analytic expressions for the orbital motion will be given. 



1 Introduction 

The convincing explanation of the anomalous perihelion advance of Mercury, the 
first success of general relativity, has been right a post-Newtonian (PN) achievement. 
So PN approximation techniques are part of the application of the Einstein theory 
from the very beginning. Since 1915, the year of the birth of general relativity, a 
lot of work has been invested into the exploration of the theory in terms of PN 
approximation schemes. Typical for all these PN schemes is the Newtonian theory 
as starting point and the expansion of the structure of general relativity, in the weak- 
field and slow-motion regime, in inverse powers of the speed of light c in the form 
of terms containing powers of jc^ and Gmjrc^, where v and m respectively 
denote a typical velocity and a typical mass of the bodies in question and r a typical 
relative distance; G is the Newtonian gravitational constant, e.g. [1-3]. The nPN 
order of approximation means the nth power I . Eor bound systems, the virial 
terms guarantees v^/c^ ^ Gmjrc^ . Starting with the level where radiation reaction 
enters for the first time, half-integer n enter. The first radiation-reaction level is of 
2.5PN order. If the internal dynamics of a body is decoupled from its external one, 
the internal dynamics must not have weak gravitational fields to participate in the 
external weak-field and slow-motion dynamics; so neutron stars and black holes can 
be objects of PN systems. Eamous is the derivation of the IPN dynamics of n-body 
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systems with strong self interaction published by Einstein, Infeld, and Hoffmann 
(EIH) in 1938, [4]. Here the technique of closed surface integrals around the bodies 
has been applied, themselves located in the weak field regime. Thinking in terms of 
Gauss’s theorem it is understandable why with the help of Dirac delta functions the 
same result could have been achieved later on; however, with much less amount of 
computational work. The interested reader may compare the two different techniques 
in the recent computations of the 3PN binary dynamics, where Itoh and Eutamase 
made use out of an EIH-type approach [5, 6] whereas Damour et al. [7] and Blanchet 
et al. [8] applied the Dirac delta function ansatz. Also a recent rederivation of the 3PN 
binary dynamics by Eoffa and Sturani [9] within the effective field theory formalism 
used the Dirac delta function or its Eourier transform. 

The Hamiltonian formalism of general relativity has been inaugurated in two 
seminal papers by Dirac in the years 1958-1959, [10, 11], and developed in great 
detail by Arnowitt, Deser, and Misner (ADM) in the years 1959-1960, see review 
[12, 13]. Also Schwinger devised a Hamiltonian formalism based on the ADM 
formalism but with applying tetrads in a first-order formulation of general relativity, 
in 1963, [14]. Already in the same year. Kibble implemented the Dirac field into 
the Schwinger formalism, [15], which, in the realm of the Dirac approach, [16], was 
fully worked through by Nelson and Teitelboim in 1978 only, [17]. DeWitt, in 1967, 
[18], and Regge and Teitelboim, in 1974, [19], delivered refined treatments of the 
Hamiltonian formulations of general relativity with respect to the involved surface 
integrals, particularly the famous Hamiltonian one of ADM. 

The ADM formalism of general relativity is the most often applied one out of these 
formalisms. Its application to the PN approximation already started with Kimura in 
1961 who rederived the IPN dynamics of EIH, [20]. In the 1970 s a lot of work has 
been performed in Kimura’ s research group for the obtention of the 2PN dynam- 
ics, particularly in the ADM formalism [21]. The full set of expressions of the 2PN 
dynamics in Hamiltonian form was finally obtained by Damour and the author in 
1985, [22]. Later on in 2001, by Damour, Jaranowski, and the author, for the first 
time, the 3PN binary dynamics has been computed and finally, most recently, the 
4PN compact binary dynamics, where in both cases judicious dimensional regular- 
ization procedures were applied, [7, 23]. The paper by Damour, Jaranowski, and the 
author about the Hamiltonian of two spinning compact bodies with next-to-leading 
order gravitational spin-orbit coupling, published in 2008, can be regarded as the 
beginning of the PN treatment of gravitationally interacting spinning bodies in gen- 
uine Hamiltonian form, [24]. Already in 2009, a seminal paper appeared where the 
implementation of classical spinning bodies into general relativity was performed 
to linear order in spin, [25]; a detailed presentation is given in [26]. The state of 
the affairs in the derivation of higher order gravitational Hamiltonians with spinning 
compact objects can be found in [27] . Our list of References mainly contains papers of 
Hamiltonian provenance. In the lists of References presented therein, related papers 
applying non-Hamiltonian approaches can be found. 
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2 Newtonian Dynamics in Hamiltonian Form 

A full set of mathematical relations for a Newtonian ideal fluid is given by the 
equations, e.g. [28], the Einstein summation notation applied, (i) 



dtQ^ + = 0 , ( 1 ) 

the mass conservation equation, where denotes the mass density and the velocity 

field of the fluid, (ii) 



g^dtv^ djig^^v^yj) = -dtp g^^diU, ( 2 ) 

the equations of motion, where p is the pressure of the fluid and U the Newtonian 
gravitational potential, (iii) 



— = dts v^diS = 0, (3) 

at 

the entropy conservation equation for the mass elements along their motion in space, 
where 5’ denotes the specific entropy, (iv) 



e = e{g^,s), 

the equation of state, where e is the specific internal energy, (v) 



de — ^dg^ + Tds, 



the Gibbs fundamental form, with T being the temperature, (vi) 



U(x, t) = G d^x 



J 



3 



X - X' 



the gravitational potential expression in terms of its source, or 



( 4 ) 

( 5 ) 

( 6 ) 



AU = -47tGp*, (7) 

the Newtonian gravitational field equation. A denotes the Laplace operator and |...| 
the standard euclidean distance. Partial derivatives with respect time and space, 
employing cartesian coordinates x = (x^), (i = 1, 2, 3), are denoted dt and 9/, 
respectively. 

In a Hamiltonian formulation, e.g. [28], all the dynamical equations are derived 
with the aid of a Hamiltonian functional H = H[g^,7Ti,s] which depends on such a 
set of variables that the equations of motion follow by variations. The new variable 
herein is the momentum density tt/. It relates to the velocity field through = ^. 
The dynamical equations presented above, are now. 
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(i) mass conservation, 



17 = -®'!- 



(S-) 



(ii) equations of motion, 



dni 

dt 






(iii) entropy conservation, 



ds SH ^ 



( 8 ) 



(9) 



( 10 ) 



In terms of generalized brackets, the so-called Lie-Poisson brackets, e.g. [28], 
equations of motion generally read 



dtA(x, t) = {A(x, 0, fi). 



( 11 ) 



In our case, the fundamental variables fulfil the following Lie-Poisson-bracket 
relations 



{7r, (x, f), £>*(x',0) = ^[£>*(x', t)(5(x-x')], 

, 5s (x', t) ^ , 

{7r,(x, 0,s(x,0) = S(x-x), 

ox'’ 



( 12 ) 

(13) 



{7Ti(x, t), 7Tj(x', t)} = TTi(x', t)^yjS(x - x') - TTj{x, t)-^5{x - x'), (14) 

dx'J dx^ 

where / cf’x S(x) = 1. 

The introduction of Lagrangian variables q^(x,t) and Pa(x, 0. with A = 1, 2, 3, 
turn the Lie-Poisson brackets into standard Poisson brackets, reading 



{^"^(x, t), Pb(^\ t)] = S^5(x — x ') , zero otherwise. 



(15) 



where the following relations between the Eulerian and Lagrangian variables hold, 

' dq^{x, t) 



Q^ = QL[q (x, OJdet 



dx^ 



(16) 



^(x, t) = siVq 0], 



(17) 



Higher Order Post-Newtonian Dynamics of Compact . . . 



591 



dq^{x, t) 

7Ti(x, t) = Pa(x, t) . 



(18) 



As each fluid matter element keeps its Lagrangian “name”, say A or B, the Lagrangian 
position held fulfllls a conservation equation, 



d,q'^ + v^diq^ = 0 . 

Finally, the Hamiltonian functional of our ideal fluid is 






~ 2 



t 3 r3 



H = - I — — I + J d^x g^e. 



2 J p* 

The total linear momentum of the fluid is given by 

Pi = J d^xnt, 

its total orbital angular momentum reads, 
and the center-of-mass vector takes the form. 






Gj = I d^xx^g^. 



The transition to point particles is achieved through the settings 

^ ^g^(x — Xq), 7T/ = ^ Pai6{x — X^), 
a a 

which result in standard Poisson brackets, 

{v^, paj} = Sfj , zero otherwise, 

and in the standard Hamiltonian of gravitationally interacting point masses, 

^ PaiPai 1 GmaMb 

• ^ 9 • ^ Iy„ — I 



(19) 



( 20 ) 



( 21 ) 



( 22 ) 



(23) 



(24) 



(25) 



(26) 



a^b 



with mass parameters and linear momenta Pa, = 1, 2, 3, ..., A), the number 
of point masses being denoted by N. Dusty matter would have been obtained by 
imposing p = e = s = 0. 
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Evidently, the total dynamical quantities of above read, 



Pi = '^Pai, 



(27) 



a 




(28) 



a 



and 



Ki — Gi t Pi , Gi — ^ MgX ^ , 



(29) 



a 



where the constant boost vector Kt has been introduced. 

Independently from its structure, any isolated system has to fulfil the well known 
Galileo algebra 



where M is the total mass, M = nia . Notice in view of the following section that 

the components of the center-of-mass vector commute in Poisson brackets and that 
the total mass breaks the otherwise uniform appearance of the dynamical quantities. 



3 Relativistic Systems in External Gravitational Fields 
in Hamiltonian Form 

The energy-momentum tensor of an ideal fluid takes the form, e.g. [3, 28], 

+ £>e + p)uijU^ -\- pS^, = -1, = gvnU^, (36) 

where g is the proper rest mass density of the fluid and u'^ its 4-dimensional timelike 
velocity field with its 3-dimensional components reading u' = u^v‘ /c. The space- 
time metric is (p, v = 0, 1, 2, 3), and denotes the standard Kronecker delta, 
e and p have the same meaning as in the previous section which is also the case for 
s and T, 



{Pi,H] = {Ji,H] = Q, 
{Jii Pj] — ^ijk Pkt {Jiy Jj] — ^ijk Jk^ 
{•7/ , G j] — ^ijk 
{Gi, H} = Pi , 
[Gi, Pj} = M5ij, 
[Gi,Gj} = 0, 



(30) 

(31) 

(32) 

(33) 

(34) 

(35) 



e = e{g, s), 



(37) 



Higher Order Post-Newtonian Dynamics of Compact . . . 



593 



de = ^dg + Tds. 

The independent variables are defined by, = det(^^^y)), 
= s, 7T/ = 






(38) 



(39) 



They do have the same Lie-Pois son-bracket structure as in the Newtonian dynamics 
case, 



d 

{7T/(x, 0, ^*(x', 0} = 

, ds(jd , t) ^ , 

{7T/(x, 0, ^(x , 0} = ^ d(x - X ), 

ox'^ 



(40) 

(41) 



{7T/(X, t), TTjix', 0} = TTiix', t)^jj6(x - x') - 7T j (x, t)^6(x - x'). (42) 

dx^J ax^ 

Also the equations of motion do have the same form as in the Newtonian case, 



dg* 

dt 




1 = 0, (43) 


ds 




u>^df,s = 0, (44) 


dt 


= 

OTTi 


= -dj 1 


(SH \ 




Wn ■ ‘ 






V/. {yr"gTt) = 0, (45) 



Qtt,- 



with V„ the 4-dimensional covariant derivative, where again 



V = 



6H ; u' 

= c — 

Sirt^ mO 



hold. The total linear and orbital momenta read 

Pi = J‘ d X ITi, Ji = j d X CijkX'^ TTk- 

Dusty matter is again obtained through 



(46) 



(47) 



e = p = s = 0, 



(48) 
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and the transition to point particles is made by 

^ i 

Q* = '^maSix - Xa), 7 T/ = ^ Pai^(^ ~ X«), ^ ( 49 ) 

a 

with standard Poisson brackets, 

Paj) = Sij, 

and standard equations of motion, 

dpai ^ dH 
dt ’ 

The Poincare algebra, reading, e.g. [2 

{Pi, 

{di, Pj} — ^ijk Pk, 

{G/, 

generalizes the Galileo algebra to relativity. The boost vector Ki fulfills the relations 
Ki = -t Pi + G/, dKi/dt = dKi/dt + {Ki, H] = -Pi + {G/, H] = 0. 

Canonical expressions for a non-interacting particle are, setting c = 1, the 

total angular momentum: J = X x P + S, 

Hamiltonian: H = \/m^ -h P^, 

Lorentz boost: K = — tP -h HX — x P, 

center-of-energy position vector: X = X — S x P. 

The constant boost vector can be written: K = — tP -h HX, G = HX. 

For isolated conservative many particle systems with interaction one gets, using 
again standard notations = (pai), etc., for the 



a 



zero otherwise. 


(50) 


II 


(51) 


' J 9 

H] = [Jt,H} = 0, 


(52) 


{di , dj} = £ijk dk, 


(53) 


[di, G j} = Si j]^ G^, 


(54) 


{Gi,H} = Pi, 
h,Pj} = 2H5ij. 

G ^ ^ijk dk’) 


(55) 


(56) 


(57) 
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total linear momentum: P = 

total angular mommentum: J = x Pa + 

total rest mass squared: —P^ , H = \/ + P^, 

center-of-energy vector: G = (J — X x P) x P. 

The standard Poisson brackets read, 

{X\ xj} = {P\ pj] = 0, {X\ pj] = 

{M, xj} = {M, pj} = {M, H} = 0. 

There are various center-of-body definitions in use, the 

(i) , center of spin: X; [X ^ , X^ } = 0 (Newton- Wigner coordinates), 

(ii) , center of energy: X = X - ^ x P, 

(iii) , center of inertia: X = X -h S x P. 

Connected herewith are spin supplementary conditions (SSC), for 

(i) , center of inertia: Pjj = 0, 

(ii) , center of energy: S^^rijy = 0, ^^ = (—1,0, 0, 0), 

(iii) , center of spin: Pj^ = 0, 

where, say, 5'^^ = —5'^^ and S^j = pj^S^ . The various center definitions are 
intimately connected with the fact that the components of the center-of-energy vector 
have non- vanishing Poisson brackets; also see [2]. 

4 General Relativity in Hamiltonian Form 

Following the work by ADM [12, 13], general relativity can be put into a non- 
degenerate canonical form if e.g. the following four coordinate conditions are 
applied. 




( 58 ) 
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which mean three coordinate conditions, where hJJ is transverse and traceless, i.e. 
= 0 ’ (g = detigij)), 

7t" = 0, tt'-' = K), tt\ = gijTr‘^ = hJJ , (59) 

which means one coordinate condition. Kfj is the extrinsic curvature tensor of hyper- 
surfaces t = const (K = gijK^j) and Y^gkj = ^ unique decomposition is 

possible, reading -h tt-^j with 

= diTi^ + djTi^ - ^5ijdk7T^. (60) 



Again, tTjj is transverse and traceless. The canonical conjugate field to hj^ is denoted 

/16ttG. These both fields are the independent canonical variables of the grav- 
itational field. The field functions 0 and are determined by the four constraint 
equations. 



> ^ ^ z + y 



PaiPaj] 



(61) 



the Hamiltonian constraint, which is equivalent to the Einstein field equation 
gOO = 



(62) 



the momentum constraint, which corresponds to the Einstein field equations G? 

SttG y'O 

y i 

The famous ADM Hamiltonian is defined by 



H 



[x‘a< 



Pai 






-Al = f 

167tG J 



d^x A0 



Xa,Pai,h]J,Tr: 



'TT 



(63) 



which also can be brought into a surface integral form using the Gauss theorem. 

The description of the conservative dynamics part is most efficiently performed 
with a Routh functional, e.g. [30], 



R 



x^, Pai, hJl,dth 



r.TT 



= H - 



167tG 



/' 






(64) 



resulting in the following evolution equations. 
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5 f R{t')dt' _ . _ dR .i _ OR 

5hJf{xKt) “ ’ 



(65) 



The on-field- shell Routhian describes the conservative dynamics, 



^on(0 — ^ 



4’ Pai^ Pak], dthjful, Pak] 



( 66 ) 



. ,, 5 f Ron(t')dt' ■ 5 f Ron(t')dt' 

Pai(t) = —rj77^ 4(0 = 



<^4(0 



Spai(t) 



(67) 



6 f Ron(t')dt' _ dRon d dRo 



+ Z = (x‘ Pai). 



5z{t) dz{t) dt dz{t) 

Within the present context, the fundamental PN expansion reads 



R 



\x'^,Pai,hJ^, 



dfh 



= Mc^ + 



n=0 ^ ' 



( 68 ) 



x‘a^ Pai,h]j,dthjj'^, (69) 



in the form (symbolic writing). 



where hj^j = ^h-j . Hereof the PN expansion of the field evolution equation follows 



^ ^ m =0 ^ ' 



Vx,Xa{t),Pa(t).h^(t).dth^\t)-\. (70) 



XT/ 



The order c is called the nth PN order. Also the multipole expansion of the 
gravitational field is of PN type, e.g. in far zone (FZ), [31], 



/XT/ ^ Pijkm 0 



“§l(a 



^ 1 4 






13.. .11 



+ 



(c^) (/ + 1)! 0 C ) 



(71) 



where the mass quadrupole tensor is given by, showing the standard Newtonian mass 
quadrupole tensor Mij and the leading order backscattering or tail contribution, [32], 
M denoting the total mass and h a scaling constant. 



Mij(t --)= M,- 



('- 7 ) 
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+ 



2GM 

^ Jo 



2GM 
r* = r H ^ 



dv In + 0{\/c^), 

ln(^) + Od/c’)^ 



(72) 



Through second PN order in the wave emission, i.e. two PN orders beyond the leading 
order or, symbolically, order c“^(l + c~^ + c~^), the time-averaged gravitational 
wave luminosity < C{t) > and energy loss < ^ > are given by. 



C{t) = 



327tG 



FZ 






do. 



(73) 



and 









1 



IJ IJ 






16, 



(3)q(3) 



4 



-F— 



(4)q(4) 



9072 ^J^^ ^J^^ 84 



(74) 



dE{t-r^/c) 

— < > = < C{t) > . 

dt 



(75) 



The leading term is known as the famous “Einstein quadrupole formula”. 



5 Spin and Gravity 

The implementation into gravity of proper angular momentum (spin) of a classical 
or quantum particle is most efficiently performed with the aid of a tetrad field 
e.g. [14-16], 

= r]ab , eauebvT]^^ = (76) 

where rjab = denotes the Minkowski metric, a, b = (0), (1), (2), (3), and 
the metric of physical spacetime. A local Lorentz transformation operates in the 
form 



eT = L\e>i, L“^VabL‘’,=Vcd. 



(77) 
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A linear connection is needed for the spacetime derivatives to transform 
homogeneously under local Lorentz transformations, e.g. [33], 



D^4> = d^4> + -wfG[ab]4> 



(78) 



with 



= + (79) 

Infinitesimal local Lorentz transformations with group parameters induce the 
change 



8(j) = 



(80) 



on the mathematical object (p. 

The curvature tensor of the tetrad field, is defined by 



D^Dy(j) - Dj,D/^(t) = R^^^^G[ab](j) , 



( 81 ) 



and results in 



r>ab f) , ,ab o , ,ab ■ , ,ac , ,bd^ , ,ac , ,bd^ /qo\ 

^ HU = Vcd - Vcd ■ (82) 

The Lagrangian density of the gravitational field is given by 

Cg = yLdet(e^)e^4f?"^^(a;) + . (83) 

The vacuum Einstein field equations take the forms, 

5 G 1 

0 = -J^eau ^ 2det(ep(/?^^ - -g^^R) (84) 

and 

^ (85) 

[I 

The latter equation eliminates torsion from the tetrad field. If this condition is also 
kept if matter sources for the gravitational field are included, no torsion will enter 
the gravitational field. 

The Lagrangian for spinning objects coupled to gravity reads, e.g. [26] and 
References therein. 
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-Cm 




dz>^ \ 



56'^’’' 

dr 



%)> 



( 86 ) 



where Sab is the spin tensor in tetrad components and 56‘^^ /dr the angular velocity 
tensor. This Lagrangian is degenerate and thus needs an additional Lagrangian which 
imposes supplementary conditions, 



Cc 




^iP^Sab + Pa 



2 



(p^ + m^) 



%), 



( 87 ) 



where , A 2 [/] , A 3 are Lagrange multipliers, pa = Pab P^ the four-dimensional linear 
momentum of the particle with p^ = PaP^, and the rotational matrix between 
the spacetime tetrad ea and a body fixed one; the latter tetrad has capital latin indices 
as names, C = [ 0 ], [/], whereas the former one has small latin ones, a = (0), (/), 

60 ^^ = = - 60 ^^. ( 88 ) 

Obviously, S9‘^^ is not an exact one-form. 

The equations of motion which originate from our Lagrangians read. 



J^^ab 

Dt 



= 0 , 



( 89 ) 



^ = -\R^^pabuPS‘^K ( 90 ) 

Dt 2 

or, in well known spacetime component representation, 

_^= 0 , ( 91 ) 



— 2 ’ Rupap — ^a^^f)Rb-pab^ 



( 92 ) 



where 



n dZ^ n 

= — = Mp^ ’ 
dr 

The energy-momentum tensor T of the spinning body has the form 



( 93 ) 



= j 



dr 



Xj,pl^p''S(4.) + 






( 94 ) 
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where (5(4) is the four-dimensional delta function and r the proper time of the body. 
This tensor is the well known pole-dipole energy-momentum tensor by Tulczyjew 
[34]. 

As we are interested in a canonical formulation of the gravitational spin dynamics, 
the line element of the spacetime metric has to be put into the ADM representation, 

ds^ = —{Ncdt)^ -h gij(dx^ -h N^cdt)(dx^ -h N-^ cdt), (95) 

where N and denote the lapse and shift functions, respectively, coined so by 
J. A. Wheeler in 1964, [35]. 

The non-reduced Hamiltonian has the general form, ( / = di), 

H= j cPxiNH - N‘Hi) + £ d^Si (gijj - gjjj), (96) 

where Ti and Hi are the super-Hamiltonian and super-momentum, respectively. The 
surface integral is taken at spatial infinity, denoted At the lapse and shift 
functions are assumed to behave as 



A^l .0 = 1 + 0(1/ r ) , 1^-0 = 0(l/r). (97) 

The famous constraint equations 



n = 0 and Hi = 0 (98) 

result from the fact that the functions N and are Lagrange multipliers. 

If the constraint equations are fulfilled and adapted coordinate conditions are 
applied, then 



^4 n 

H = d^Siigijj - gjjj) = Hadm (99) 

describes the Hamilton functional of the total system, called the ADM Hamiltonian. 

To simplify notation, from now on and for the rest of this section we put 
c = G = 1 . The solutions for the matter supplementary conditions read, for 
(i), Ai: 



(ii), A 2 : 



nSi = n^Sui = 



np 



gijnSJ, 



yvW(O) = 



P(i) 

p(0)’ 



r,a 

^[0]« ^ _E_ 



( 100 ) 



m 



( 101 ) 
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(iii), A3: 

np = n>^pn = -yjm^ pipj, = 5), (102) 

where the timelike unit vector orthogonal to the t = const slices reads, 

= (1, -N^)/N, = (-N, 0, 0, 0). (103) 

The time gauge for the tetrad field, coined so by Schwinger [14], is defined by, 
also see [16], 



^(0) ~ ^ 



— - - 



■^(0) 



1 

V’ 



^(0) - ■ 



AT* 

V 



(104) 



thus. 



gij=ef''^e(^)j. (105) 

The matter part of the constraint Lagrangian density takes the form 

Cmc = (106) 

with 

^matter ^ _ j^ij Pi!^g _ (107) 

np 



and 



nf 



= (Pi + KijuShS + ( . (108) 

\2 ' np 



Here, a semicolon denotes three-dimensional covariant derivative. 

The transformation to canonical matter variables, , Sij , Pf, reads 



- 



nS^ 

m — np' 



nSi = - 



Pkl^^s 



Jt 



^ij — 



PinSj ^ pjnSi 



m — np m — np 
nU) 



xmu) = x^iW)L ^ 

\ m{m — np) ) 



(109) 

( 110 ) 
( 111 ) 
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Pi — Pi + KijuS^ + A ^{j)k^i^i np 

kj 

where F ■ are 3 -dimensional Christoffel symbols and 

SikSjlA — Si j . 

2 np{m — np) 

The spin tensors and vectors have constant three-dimensional length, 
S Sab — ^{i)U)^(i)U) ~ — COnst, 






and the canonical three-dimensional rotation matrix is orthogonal, 

AgA'«»=i,, 

also, it holds. 

The purely kinetic Lagrangian density of the body takes the form, 

^KM = PiZ^ ^ + -S(i)(j)¥^^^-^^6 
and a gravitational Lagrangian related with the body reads. 



^GM = ^{k)iefJ^S. 

Adding the Lagrangian density of gravity, £g, yields, 

Cgm +Cg = S]e(k)iefl + Cgc ~ ’ 

where Si = gijj — gjjj and the total energy is given by 

The gravitational constraint Lagrangian density is 
Cqc = 



(112) 



(113) 



(114) 



(115) 



(116) 



(117) 



(118) 



(119) 



( 120 ) 



( 121 ) 
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where 



n 



field 



167Ty^ 



- gij 



gijgklT^'^T^^^ 



with R the 3 -dimensional curvature scalar, and 



1 



-T/field ^ jk 

n, =^sij^-x 



( 122 ) 



(123) 



The pure field momentum reads, 

tt'-' = Vg(7'-'7^^ - 7'^7-'^)^W- (124) 

The spatially symmetric time gauge, introduced by Kibble in 1963, [15], is defined 



by 



^{i)j — ') — §ik Or ? — \/ iSkl)- 



(125) 



It allows the elimination of the tetrad field from the formalism (otherwise the Einstein 
theory of gravity could not be fully regained). 



~ ^ij Skl,ii + 



where Bfj = = -Bf.. 

The new canonical field momentum is then given by, cf. [15], 



-h 167T 6 



(126) 



(127) 



Additionally, for the obtention of unique dynamical evolution also appropriate 
spacetime coordinate conditions must be imposed, in the ADM case. 



^SijJ 8jj,i — 0 ’ 



<an = 0 



(128) 



or. 



gij — + h] 



jr^j — 7T^7 -U 
'‘can — — 



TTcan’ 



Again, transverse traceless conditions apply. 



(129) 



/,TT _ ii _ ^TT _ _ Q 

^TTcan ~ ^ijj ~ ^TTcan,; 



( 130 ) 
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as well as longitudinal traceless ones, 




(131) 



The constraint equations read, 






^matter 



= 0 , + n" 



^matter q 



(132) 



Finally, the total canonical action results in the form. 




and the Hamiltonian reads 




The Poisson brackets take the particularly simple form. 



{z\ Pj} — Sij, [S{i), S(j)} — QjkS{k), 



(135) 



t), TT^canN. 0} = S(X - x'), 



(136) 



and zero otherwise, where i5™^(5(x) is a transverse traceless Dirac delta function, 
e.g. [12, 13], 



6 Higher Order Post-Newtonian Hamiltonians 

In the following, reduced variables in the the center-of-mass system will be used, 
defining = (H — Mc^)/ fi, fi = m\m 2 / M ,M = m\-\-m 2 ,y = yu/M, 0 <v< 1/4 
with test-body case z/ = 0 and equal-mass case u = 1/4. In the center-of-mass 
system pi + P2 = 0 and p = Pi//i. Furthermore, = (n- p), q = (xi — X2)!GM, 

n = q/|q|. 

The 3PN conservative binary black hole dynamics has Hamiltonian, [7, 36], 




( 137 ) 
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where 



Hn = 




(138) 



^[IPN] = 



1)/ 



1 ,,11 
-[(3 + u)p + I'Py.] - + 

2 q 2q^ 



(139) 



^[2PN] = 77(1 - + 5 v^)p^ 

lo 

+ 1[(5 _ 201/ - 3 i/ 2 )/ - 21 / 2 _ 3 j, 2 ^ 4]1 

O ^ 

+ h(5 + 81 /)/ + 3i/p2] J- - 1(1 + 3 i/)2, 

2 4 



(140) 



-^[ 3 PN] = + 35 u^)p^ 



+ T7[(-7 + 42i/ - 53 i/ 2 - Si/^)^® + (2 - 3i/)i/2;/2^^ 
16 



+ 3(1 -;/)z/2p4p2_5^3^6]i 



+ 



2 (-27 + 1361/ + 1091/2)/ + 2(17 + 30i/)i/// 
16 16 

1 



+ 2(5 + 43z/)i// 

12 J 



r/ 25 /I , 335\ 23 ,\ 

[ V 2 ~ ~ Y'" ) 

1 85 3 2 7 \ 2 I 1 



+ 11 - 

+ 



ri /109 21 , 



)2 



(141) 



The dissipative dynamics is non-autonomous with Hamiltonians explicitly depend- 
ing on time, H 2 , 5 m{t) + + .... The leading order 2.5PN binary black hole 

orbital Hamiltonian reads, e.g. [37], 



^^2.5Pn(0 = ^Qijit) (2iZli 
5c^ V 



+ 



PliPlj Gm\m2 i j 



m 2 



r\2 



^ 12^12 



)■ 



(142) 
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where the variables in 



Qij{t) 




(143) 



are allowed to be identified with the variables outside Qij (t) only after the Hamilton 
equations of motion have been derived. 

The spin-gravity interactions of compact objects have the forms, [38-40], to lead- 
ing order spin-orbit coupling, (S = S, p = canonical momentum). 






LO 



SO 



xnafo)- 

Cl b^a cib 



3mb 

Pa - 2pb , 

2ma J 



(144) 



to leading order spin(l)-spin(2) coupling, 

<°s. = ^ZZ — 3 - [3(Sa • ■ XVab) - (Sa ■ S^,)] , (145) 

a b^a^^ab 



and to leading order spin(l)- squared coupling, 



LO ^ [3(s, . ni2)(Si ■ ni2) - (Si ■ Si)] , (146) 

where Cq^ = 1 for black holes and Cq^ > 1 otherwise, e.g. [41, 42]. 

For the next to leading order couplings, the corresponding expressions are, for 
spin-orbit coupling, [24, 43], 



^NLO _ 

^so — 



- ((Pi X Si) • ni 2 ) 



5wi2Pi 3(pi ■ p2) 



8m j 



4m j 



3p2 



'2 3(pi • ni 2 )(p 2 • ni 2 ) 3(p2 ■ 1112 )^ 

' .0 ' 



Am\ni 2 4m J 

+ ((P2 X Si) • ni2) 



2m\m2 



(Pi P 2 ) 3(pi •ni 2 )(p 2 -ni 2 ) ' 



m\m2 



m\m2 



+ ((Pi X Si) -P2) 



2(p2 ■ 1112 ) 


3(pi -ni2)] ■ 


m\m2 


Am\ 



+ 43 


- ((Pi X Si) • ni 2 ) 


llm2 5m? 






2 mi 



+ ((P2 X Si) • ni2) 



6m 1 -h 



15m2 



+ (1 ^ 2 ), 



(147) 



for spin(l)-spin(2) coupling, [44], 
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= (G/2mim2cV^)[3((pi x Si) ■ ni2)((p2 x S2) ■ ni2)/2 
+ 6((P2 X Si) ■ ni2)((pi X S2) • 1112) 

- 15 (Si • ni2)(S2 • ni2)(pi • ni2)(p2 • ni2) 

- 3 (Si • ni2)(S2 • ni2)(pi • P2) + 3 (Si • P2)(S2 • ni2)(pi • 1112) 

+ 3 (S 2 • pi)(Si • ni2)(p2 • ni2) + 3 (Si ■ pi)(S2 • ni2)(p2 • 012) 

+ 3(S2 • P2)(Si • ni2)(pi • 1112) - 3 (Si • S2)(pi • ni2)(p2 • ni2) 

+ (Si • pi)(S2 • P2) - (Si • P2)(S2 • pi )/2 + (Si • S2)(pi • P2)/2] 

+ ( 3 / 2 mir^)[-((pi x Si) ■ ni2)((pi x S2) • 1112) 

+ (Si • S2)(pi • 1112)^ - (Si • ni2)(S2 • pi)(pi • ni2)] 

+ (3/2w2r^)[-((p2 X S2) • ni2)((p2 x Si) • ni2) 

+ (Si • S2)(P2 • ni2)^ - (S2 • ni2)(Si • P2)(P2 • ni2)] 

+ (6G^(mi + m 2 )/cV)[(Si ■ S 2 ) - 2(Si ■ ni 2 )(S 2 ■ 1112 )], (148) 

and for spin-squared coupling, [45, 46], 

+ (Pi • “) (Si • n) (pi ■ Si) 

+ ^ “ 4 + ~ 8 2 ^®') 

+ |^3-^Ce,^(pi-p2)(Si-n)2 

+ ^ - ^ + (P2 • “) (Pi • Si) (Si • n) 

+ ^ - 3 + (Pi • n) (P2 • Si) (Si • n) 

+ Q - (Pi -Si)(p2 - Si) + 0 - (Pi ■") (P2 -n)S^ 

+ 0 I + ™ + ^5 (S. ■ »)" )] 

+ ^[(® + + ^0 + ®<®e,))s; 

- (3 + 5<^ei + ^(i+6Ce,)) (Si-n)2j. 



^s? 



NLO 



(149) 
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This expression goes beyond the general linear-in- spin structure and needed addi- 
tional considerations as undertaken in the quoted References. 

Properly augmenting the discussed Hamiltonians, the present knowledge reads, 



All the higher order dissipative Hamiltonians, including the leading order spin-orbit 
and spin(l)-spin(2) coupling ones, have been computed in the Refs. [47-50]. 



7 Applications 

Important applications of the dynamics of binary systems are the computation of 
their orbital and spin motions. Here particularly, the orbital period, the periastron 
advance, the innermost stable circular orbit (ISCO), and the spin precessions are of 
great interest. For not entering into complicated expressions within higher order PN 
settings, only spinless particles will be considered. For the case of spinning bodies, 
the reader may consult the Refs. [51-54]. 



7.1 Orbital Motion 

The derivation of orbital observables is most elegant with starting from the action 
functional of the particles. The radial action, denoted by ir{E, j) with E reduced 
energy and j reduced angular momentum, is defined by 



i7(0 = + ^IPN + ^2PN + ^3PN + ^4PN 




+ ^2.5Pn(0 + ^3.5Pn(0 




(150) 




(151) 



where the integration has to be performed from mininal radial separation to minimal 
radial separation {pr = 0), notice I r^. In reduced variables, the full 

revolution of phase, O, is given by 
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and the orbital period, P, results in the form 



d 

= —ir(EJ), 



IttGM dE 

see, e.g., [55]. 

Through 3PN order, the periastron advance k turns out to be. 



,2 j2 



+ 



1 



1 

1 + ^ 



1 



h7-2u)^ + U5-2u)E 
4 7 ^ 2 



+a2(i^)^ +a3(i^) 



and the orbital period results in, 
P 1 



2itGM {-2E)^I^ _ 



1-41(15-^)^ 

4 



1 

+ “T 



1 

+ 



■3 {-2Ef!^ 3 o 

(5 _ 2v)- ^ :?t( 35 + 30z/ + l>v^) E^ 

.2 7 32 

(2i(y) 3 ( 23 (z/) : 1 “ < 24 (z/) E^ 






J 



with 



a\{u) 



2 V 2 V 64 3/ 4/’ 

/41 2 218\ 



ai{v) = ^r- + ( — 7T — I U + —V^ 



a^iv) = -(5 -5v + Av ^) , 

a4(j/) = —(21 - 105 j/ + \5u^ + 5u^). 
128 



The orbital motion takes the form. 



r = a^Cl — ercosu ) , 

2tt 

— (t — to) = u — etsinu + Ey^uiv — u) + Tasini; + ... , 

27T 

— (0 - (po) = V + GiiSinu + G2i,sin(2u) + G3i,sin(3i;) + ..., 



(153) 



(154) 



(155) 

(156) 

(157) 

(158) 

(159) 



(160) 
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where the true anomaly v is parametrized in terms of the eccentric anomaly u in the 
form 



More details can be found in [36, 55-58]. 



7.2 Innermost Stable Circular Orbit 

In the Einstein theory binary systems show the phenomenon of innermost stable 
circular orbits (ISCO) beyond which no stable motion is possible. For a test body 
orbiting a Schwarzschild black hole of mass M the ISCO is located at 6GM/c^, 
in Schwarzschild coordinates, or in terms of orbital angular frequency, uo, which 
not depends on the chosen coordinate system, with x = {GMuS)^!'^ I , yielding 
X = 1/6. For binary systems an exact result is not known. Clearly, the ISCO is only 
connected with the conservative part of the dynamics, the dissipative part makes the 
orbits shrinking, [59]. 

In reduced variables the (reduced) Hamiltonian may read if = if (p, r). Circular 
motion holds if Pr = (p • r) / r = 0 is valid. Then the Hamiltonian takes the form 
if = if (/, r), where j is the reduced angular momentum, + Circular 

motion takes place if ^H(j,r) = 0 holds. Then the Hamiltonian is a function of 
j only, if = if (j). The orbital angular frequency uj results in the form uj = 
which can be inverted to obtain H = The ISCO results from the condition 

= 0. This is a fully coordinate-invariant approach. Another possibility to 

o2 

obtain the ISCO is to impose ^H(j, r) = 0. In the following we will best keep to 
the coordinate-invariant definition. 

In case of a Schwarzschild black hole, the reduced Hamiltonian reads, H = 



In terms of our previously defined orbital variables, the orbital angular frequency 
for circular orbits reads. 



V = 2arctan 



1 e(h u 
^tan- 



1-e^ 2 



c^£(x). 




(161) 



27t( 1 -h k) 

UJ = CJradial H” ^periastron = ~ 



( 162 ) 
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Use of the most advanced knowledge, i.e. 4PN approximation, 



2 . 1 . 1 . 1 . 1 . 
c S(x) = Hn ^^[IPN] + ^^[2PN] + “5^[3PN] + -g^[4PN], 



results in 






27 19 

— — — u -|- — 

16 16 48 



1 i\ 
— V I . 

^8 / 



34445 2057T 

+ 



1152 



) 



155 



35 



+ 



/3969 / 123671 

^ V 11520 ~ 
/ 498449 31577t2\ 

V 6912 1152 ) 



192 
90377t2 



V -|- V “h 

192 10368 



224 \ 

( 27 ^ + ln(16x)) ]u 

3072 15 ^ 7 



Z.2- 



301 

3^' 






77 

62208* 






(163) 



(164) 



where 7 ^ = 0.577... denotes the Euler constant. Both the 7 ^-term and the log- 
term originate from the interaction of the binary with its near-zone tail field. The 
formula Eq. (164) as such is the state of the affairs in 2013, [60, 61], i.e. the full 4PN 
information as given in [23] was not needed. The ISCO is obtained by applying the 
condition = 0 (minimum for £ as function of orbital frequency). Eor z/ = 0 , 
the ISCO reads v = 0. 179... which is about 7.7 % larger than the exact value of 1/6; 
for u = 1/4, the equal mass case, one finds x = 0.236.... (more information is given 
in [60]). 
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Spin and Quadrupole Contributions 
to the Motion of Astrophysical Binaries 



Jan Steinhoff 



Abstract Compact objects in general relativity approximately move along geodes- 
ics of spacetime. It is shown that the corrections to geodesic motion due to spin 
(dipole), quadrupole, and higher multipoles can be modeled by an extension of the 
point mass action. The quadrupole contributions are discussed in detail for astrophys- 
ical objects like neutron stars or black holes. Implications for binaries are analyzed 
for a small mass ratio situation. There quadrupole effects can encode information 
about the internal structure of the compact object, e.g., in principle they allow a dis- 
tinction between black holes and neutron stars, and also different equations of state 
for the latter. Furthermore, a connection between the relativistic oscillation modes 
of the object and a dynamical quadrupole evolution is established. 



1 Introduction 



The problem of motion is among of the most fundamental ones in general relativity. 
As a part of the present proceedings on “Equations of Motion in Relativistic Gravity” 
this does probably not require any explanations. The problem is addressed using 
multipolar approximation schemes, the most prominent are due to Mathisson [1,2] 
and Dixon [3], and another one is due to Papapetrou [4]. These particular methods 
have in common that equations of motion for extended bodies are derived from 
the conservation of energy-momentum. In the present contribution, the focus lies 
on theoretical models for compact stars and black holes based on point-particle 
actions. There equations of motions follow from a variational principle instead of 
conservation of energy-momentum. These point-particle actions were probably first 
discussed in general relativity by Westpfahl [5] for the case of a pole-dipole particle 
and later generalized by Bailey and Israel [6] to generic multipoles. 
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However, without further justification, it is not obvious how a point-particle action 
relates to an actual extended body. Most important is the effacing principle [7], which 
indicates that a nonrotating star can be represented by a point mass up to a high order 
within the post-Newtonian approximation. (More details on the use of point-masses 
for self-gravitating bodies within this approximation can be found in other contribu- 
tions to these proceedings, see, e.g., the contribution by G. Schafer) This suggests 
that extensions of the point-mass action can serve as models for extended bodies, 
even in the self-gravitating case. A similar conclusion arises from the framework of 
effective field theory applied to gravitating compact bodies [8]. Indeed, the effective 
action belonging to a compact body naturally takes on the form of a point-particle 
action, which puts previous works on similar actions [5, 6] into a different light. This 
provides enough motivation for us to further elaborate the action approach of [6] in 
Sect. 2, where it is combined with useful aspects of more recent literature [9-13]. An 
application to the post-Newtonian approximation of self-gravitating extended bodies 
is omitted, because various formalisms exist for it and the aim is to highlight aspects 
that are independent of (and hopefully useful for) all of them. 

It is worth mentioning that the effective field theory framework offers a machin- 
ery which can be used, at least in principle, to compute the effective point-particle 
action from a complete microphysical description of the extended body. In practice, 
however, this procedure is not viable for realistic as trophy sical objects and one must 
be satisfied with a more phenomenological construction of the effective action. This 
is in fact analogous to other situations in physics. For instance, it is usually admitted 
that thermodynamic potentials can be derived from a microscopic description. Yet an 
explicit calculation is often too complicated, or the microscopic description is even 
unclear. But a phenomenological construction of thermodynamic potentials or equa- 
tions of state is usually possible. This analogy is further elaborated in Sect. 5. There 
an adiabatic quadrupole deformation due to spin [14] is discussed. An application 
to a binary system in the extreme mass ratio case is given. Quadrupole deformation 
due to an external gravitational fields is discussed in Sects. 4, 5 and 6, both in an 
adiabatic [15-17] and a dynamical situation [18, 19]. 

A main critique against point-particles arises from the fact that Dirac delta distri- 
butions are ill-defined sources for the nonlinear Einstein equations. But the situation 
changes once one softens the Dirac delta using regularization techniques. It is then 
possible to solve the field equations iteratively within some approximation, like the 
post Newtonian one. If one regards the chosen regularization prescription as a part of 
the phenomenological model, then point-particles must be accepted as viable sources 
in general relativity (at least for applications within approximation schemes). This 
point is further stressed in Sect. 6.4. It is important that a weak field approxima- 
tion for the point-particle mimics the field of the actual self-gravitating extended 
body away from the source. (This is precisely the criterion for the phenomenolog- 
ical construction of the effective point-particle source.) Hence, though one applies 
the effective source to weak field approximations, e.g., to compute predictions for a 
binary, strong-field effects from the interior of the bodies are taken into account. 
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The signature of spacetime is taken to be +2. Units are such that the speed of 
light c is equal to one. The gravitational constant is denoted by G. We are going 
to utilize three different frames, denoted by different indices. Greek indices refer to 
the coordinate frame, lower case Latin indices from the beginning of the alphabet 
belong to a local Lorentz frame, and upper case Latin indices from the beginning 
of the alphabet denote the so called body-fixed Lorentz frame. Round and square 
brackets are used for index symmetrization and antisymmetrization, respectively, 
e.g., = ^(A^^ -h A^^). The convention for the Riemann tensor is 

(i) 



2 Point-Particle Actions 



Action principles for spinning point particles have a long tradition, see, e.g., [5, 6, 
9-13, 20-24]. In this section, the advantages from several of these references are 
brought together. Our approach is most similar to [6]. Compared to the presentation 
in [11], a simpler (manifestly covariant) variation technique is applied and the tran- 
sition to tetrad gravity is discussed at a later stage. This makes the derivation more 
transparent. 



2.1 Manifestly Covariant Variation 



Before we start to formulate the action principle, let us introduce a useful notation 
due to DeWitt [25], see also [12, Appendix A]. One can define a linear operator ^ 
such that the covariant derivative Vq, and the Lie derivative read 

Vc. := da + := - {d,e)G\. (2) 

For instance, G^ ^ operates on a tensor Ta^ as G^ ^Ta^ := -h 6^Ta^ . That 

is, is a linear operator that acts on the spacetime indices of a tensor. Notice that 
G^ n does not act on indices of the body-fixed frame. Further, G^^ obeys a product 
rule like a differential operator. Similarly, we can construct a covariant differential 
D and a covariant variation A of quantities defined along a worldline by 

Z) := J + A := + F^^«(^z^)G%. (3) 

For scalars the contributions from the connection vanish. Notice that a variation of 
the worldline is not manifestly covariant if the component values of tensors 
defined on the worldline are held fixed. The variation A instead parallel transports 
to the varied worldline. When it is applied to a tensor taken at the worldline. 
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e.g., Ta^{z), then the variation 5 splits into a part due to the shift of the worldline 
5z^ and a part coming from the variation of the field itself. Let us denote the latter 
part by := {5TJ){z), SO we have 

(5 = (5, + {5zP)dp, A = (5, + {5z^)Va, (for fields). (4) 

For instance, the metric compatibility of Vq, then leads to 

^9 IIP — (^) 



2.2 Action Principle 

We envisage an action principle localized on a worldline z^(A). Here A is an arbitrary 
parameter, not necessarily identical to the proper time r . (Let us require that the action 
is invariant under reparametrizations of the worldline.) We further assume that the 
action is varied with respect to a “body-fixed” frame defined by Lorentz-orthonormal 
basis vectors A^^(A) labeled by A, 



^ VAB, Aa^Ab^'v^'^ ^ ( 6 ) 

Now, stars are in general differentially rotating and it is difficult to interpret a body- 
fixed frame. Such a frame is thus rather an abstract element of our theoretical model, 
inspired by the Newtonian theory of rigid bodies (see, e.g., [11, Sect. 3.1.1]). 

The constraint (6) implies that Aa^ and g^jj are in general not independent and 
one should take special care when both are varied at the same time. In order to address 
this problem, we split the variation AAa^ as 

= A^I^AA^"^] + IaCA^/'Aa") = A©^"^ - (7) 

where we used = —g^^g^^Sga /3 and (5). In the last step, we also introduced 
the abbreviation 



aA[maAa^^ (8) 

which is similar to the antisymmetric variation symbol used in [22], see also [13, 
Eq. (2.7)]. The independence of A©^^ from the metric variation d^gag will be made 
more manifest in Sect. 2.4. For now let us just appeal to the fact that the 6 degrees 
of freedom of the antisymmetric symbol A©^^ exactly matches the degrees of free- 
dom of a Lorentz frame (3 boosts and 3 rotations). Thus A©^^ corresponds to the 
independent variation of the body-fixed Lorentz frame. 
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Let us consider an action that is as generic as possible, 



W = Wf + Wm. WFlgFu....^ = i^ j d^x^R + ..., (9) 

WM{9Fi^,zP,hA^,...] = j dXLM(9FU,ul^,AA'^,Q'^\4>,), (10) 



Here 0/ collectively denotes other dependencies of the Lagrangian Lm and the dots 
denote other fields, like the electromagnetic one. (In this section / is a multi-index 
that may comprise any sort of spacetime, Lorentz, or label indices.) Notice that 
fields like are taken at the worldline position in Wm^ The 4-velocity and 
the angular velocity are defined by 



d\ d\ 



( 11 ) 



Notice that is antisymmetric due to (6) and Dg^y/dX = 0. 



2.3 Variation 



For the sake of deriving equations of motion, we may assume 5\ = 0. Then the 
variation can be commuted with ordinary or partial A-derivatives. Furthermore, the 
Lagrangian Lm is a scalar and we can make use oidLu = AL^to write its variation 
in a manifestly covariant manner, 

5Lm = ( 12 ) 

2 ^ DKa!^ dg^u d(f>] 



where we have defined the linear momentum and spin = —Sj^^ as generalized 
momenta belonging to the velocities and ^2^^, 



_ 8Lm 
■“ 1^' 






^ 9Lm 



(13) 



It should be noted that (12) can be checked using a usual variation 5 together with the 
identity (21), but here it is a simple consequence of the chain rule for A. Obviously 
this method nicely organizes the Christoff el symbols. 

The 5 individual terms in (12) are transformed as follows: 

• The 1st term of (12) is evaluated with the help of 



Au^ = Su^ -h 



D5z^^ 



dX 



(14) 
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• The 2nd term of (12) requires the most work. In order to evaluate we need 

to commute A with the co variant differential D contained in ^2^^, Eq. (11). The 
definitions in (3) lead to 

[A, D] = [(4r^«) - (15) 



Notice the analogy to the commutator of covariant derivatives, which also gives 
rise to curvature. It is useful to derive intermediate commutators first, for instance 



Next, we express (5^ T^^yQ, in (15) with the help of 



(16) 



(17) 



Now it is straightforward to evaluate A^2^^. In the result, we replace A Aa^ using 
(7), make use of 



^^z9po 

d\ 






(18) 



and finally arrive at 



clX 



(19) 



• Before proceeding to the 3rd term of (12), let us recall the transformation property 
of a tensor under an infinitesimal coordinate transformation 

0// - <pi = e.g., -u^ = (20) 



The Lagrangian is a scalar and thus invariant, but it depends on tensors which 
transform. As djj^^ is quite arbitrary, the invariance of the Lagrangian L m leads 
to the identity 



+ 



8Lm 

8Aa^ 



Aa^ 



^8Lm ^ 

9 pa 

89va 



, 9Lm ^ 



i4>i = 0 . 



( 21 ) 



We eliminate the partial derivative of Lm with respect to Aa^ using this relation 
and we replace AAa^ using (7) to arrive at 
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OLm 

dA^ 



= 



1 









OLm 

d(pi 



^ 9Lm 

dg^iy. 






+ 






_ C O Oi 






3Lm 

d(j)i 






( 22 ) 



• In the 4th term of (12) we use (5). 

• The 5th term of (12) is not touched for now, as this requires a specialization of 0/ . 
This is discussed in the Sect. 3.1. 

All these transformations are now applied to (12). Furthermore, we insert a unity in 
the form of 



1 = y* J(4), J(4) := - Z^), 



(23) 



into the terms containing field variations of type 6 ^ . This allows one to rewrite these 
variations at the spacetime point x^ and perform partial integrations. Notice that ^( 4 ) 
has compact support for finite A-intervals, so these partial integrations do not require 
assumptions on field variations at the spatial boundary. Finally, (12) turns into 



6Lm 



-I 



d^x 



pf^u‘'5(4) + (G'“^</./)^5(4) - V„(r''M^(5(4)) 



I A J, I 

+ — A(/)/ + 



d4>i 



d(pi 

Pn^v iG nii(j>j) 



Sgnuix) 



IJ,1P 



+ 









DPji 

dX 



C I, ^ 

* +5 a 



8Lm 1 DSi 
d(pi ~ 2~d\ 

1 






(24) 






Notice that the covariant and ordinary derivatives with respect to A are identical for 
the last term. 



2.4 Metric Versus Tetrad Gravity 



The separation of metric and body- fixed-frame variations by means of (7) is an elegant 
trick to derive equations of motion. However, if further calculations at the level of 
the action are performed, one often needs an explicit split between gravitational and 
body-fixed-frame degrees of freedom. This can be achieved by introducing a tetrad 
gravitational field 6a^{x), that is, a field of Lorentz-orthonormal basis vectors labeled 
by a and defined at every spacetime point 

^ Pab, ^ g^’^ ■ 



(25) 
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Tetrad gravity replaces the metric by virtue of the latter relation and regards 6a^ as 
the fundamental gravitational field in the variation principle. This allows us to split 
Aa^ as 



= AA^Ca^iz), 



(26) 



where Aa^ is now just a usual (fiat- spacetime) Lorentz matrix 

AA^AB^’llab = VAB, . (27) 

Thus is independent of the gravitational field and the announced manifest 
split is indeed given by (26). 

Based on this split, we can understand the meaning of in more detail. As 

Aa^ is a usual Lorentz matrix, we can follow Ref. [22] and describe its indepen- 
dent variations by an antisymmetric symbol := A^^5Aa^ . Then A©^^ reads 
explicitly 

(28) 

One can be even more explicit and write 60^^ as a linear combination of six indepen- 
dent variations of angle variables parameterizing A^^, see [22, Sect. 3. A]. Anyway, 
A©^^ is in fact a linear combination of the independent frame variations 50^^ with 
other variations. Now, it is legitimate to regard A©^^, 6z^, and 5ea^ as independent 
variations instead of 50^^ , Sz^, and Sea^. This just corresponds to a linear recom- 
bining of the equations of motion following from the variation. Equation (28) shows 
that this recombination manifestly removes noncovariant terms related to the 
variation and an antisymmetric part of the energy-momentum tensor due to 
(the symmetric part arises from as usual). All of this is important 

for the next section, where equations of motion are deduced from (24). 

In the next step it is possible to return to metric gravity by a partial gauge fixing of 
the tetrad. For instance, a possible gauge condition is to require that the matrix (^a^) 
is symmetric (in spite of the different nature of its indices). Then ea^ is given by the 
matrix square-root of the metric. This gauge choice leads to the same conclusions 
as in [9, Sect.IV.B], where a more direct construction was followed. In the end, the 
partially gauge-fixed tetrad is a function of the metric, so we have obtained a metric 
gravity theory. It might look like the introduction of a tetrad field accompanied by an 
enlarged gauge group of gravity is just extra baggage. However, more gauge freedom 
is important for applications, as gauges can and should be adopted to the problem at 
hand. For instance, for an ADM-like canonical formulation of spinning particles, it 
is a wise choice to adopt the Schwinger time-gauge for the tetrad [10]. Further, some 
subtle aspects of the consistency of the theory can be analyzed more easily within 
tetrad gravity (e.g., the algebra of gravitational constraints, because after reduction 
to metric gravity the gravitational field momentum receives complicated corrections 
[10]). Spinning particles should always be coupled to tetrad gravity in the first place. 
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3 Equations of Motion 

In order to draw conclusions from (24), one must further specialize the so far arbitrary 
0/. The assumptions we are going to introduce in the following are not the least 
restrictive, but already allow important insights on the structure of the equations of 
motion. 



3.1 Further Assumptions 



Let US assume from now on that the 0/ can be split into two groups. We denote by 
0field contains spacetime fields (functions of x), so its variation 

can be evaluated using (4). The second group (j)^^ contains variables defined on the 
worldline only (functions of A) and its first order derivatives where ‘ := D/dX. 
Most importantly, we assume that the correspond to independent variational 
degrees of freedom, like Lagrange multipliers or the dynamical multipoles introduced 
in Sect. 6. Without loss of generality, one can then assume that the carry indices 



[ 1^1 



= 0 . 



of the body-fixed frame instead of spacetime indices, so that 

Notice that our assumptions do not allow time (i.e.. A) derivatives of and 
as part of the If such accelerations would appear in subleading contributions 
of the Lagrangian (within some approximation scheme), then one can often remove 
them by a redefinition of variables [26] . Further, acceleration-dependent Lagrangians 
are often problematic due to Ostrogradsky instability. For these reasons, we also 
assumed that at most first-order time derivatives of the appear in Lm • However, 
our assumptions here are not entirely exhaustive. For instance, a concrete situation 
for which our assumptions should be relaxed in the future is discussed at the end of 
Sect. 5.1. 



3.2 Equations of Motion for Linear Momentum and Spin 



With these assumptions, we have 



dL 



M 



d(j)i 



A0/ 



dL 



M 



+ 



^field 

~8L 



M 



>'1 






wl 



d\ 



e / wl I ^ 






(29) 



where we used that the worldline variables do not carry spacetime indices and we 
introduced their canonical generalized momenta. 
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The second line leads to the usual Euler-Lagrange equations for the worldline degrees 
of freedom which are discussed in Sect. 6.4. Let us focus on the other terms for 
now. Using the arbitrariness and independence of Sz^ and A©^^, we can read off 
the equations of motion for the linear momentum and the spin from (24) and (29), 

*1)^, (31) 

(32) 

The total A-derivative in the last line of (24) was ignored here. (Here we assume that 
the variation vanishes at the end points of the worldline). The energy-momentum 
tensor density is simply given by the coefficient in front of Sgf^i^ll in 

(24). However, an explicit determination requires yet another specialization of 
because the fields can dependent on the metric. In the absence of one imme- 
diately recovers the result of Tulczyjew [27]. 



DPii 

dX 



DS^ 



'flU 



dX 



3.3 Quadrupole 



Let us now explore the case that H is useful to introduce an 

abbreviation for the corresponding partial derivative of the Lagrangian, 






8Lm 

8R^i>a(3 



(33) 



The conventional factor of —6 is motivated by comparing (31), now reading 



^ P!,UP - ^-VpRupPaJ'''^^^, (34) 

with the corresponding result of Dixon at the quadrupolar approximation level. An 
identification of with Dixon’s reduced quadrupole moment is tempting, as this 
makes (34) formally identical to Dixon’s result. It is important that inherits 

the symmetries of the Riemann tensor. Lrom these symmetries and the properties of 
the operator we obtain 

J-P»pGPuRapa^ = -ArP^HPR.po,^ = -AJPP^^R.po,^. (35) 



This simplifies (32) to 






( 36 ) 
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and formally agrees with Dixon’s spin equation of motion, too. Finally, the energy- 
momentum tensor agrees with the explicit result in [28] (in the present conventions, 
see (5.3) in [11]). This derives from 



(37) 



which must be further expanded using (17) and then leads to 



8Lm 

8Rjjiva(5 



SzR 










2 

3 






2 ■ 



(38) 



This is the contribution coming from the first term in (29). Another contribution 
arises from the second term in the first row of (24), which is evaluated using (35). 
Collecting all terms in front of in (24), we can read off the energy momentum 
tensor density as 



= J 



dX 









(39) 



3.4 Other Multipoles 



Dixon’s moments are essentially defined as integrals over the energy-momentum ten- 
sor of the extended body. Though these definitions can be applied to self-gravitating 
bodies, the derivation of the equations of motions based on these definitions only 
succeeds for test-bodies [3]. It was shown in [29] (see also the corresponding contri- 
bution by A. Harte in these proceedings) using methods for self-force calculations 
that for self-gravitating objects the equations of motions are still of the same form, 
but the multipole moments must be renormalized. The multipoles arising from the 
effective action should therefore be related to these renormalized moments. For self- 
gravitating bodies, one can not in general calculate the moments in the equations of 
motion using Dixon’s integral formulas any more. In the language of effective field 
theory, the multipoles are calculated through a “matching” procedure instead, which 
will be explained in Sect. 5. 

Other gravitational multipoles can be incorporated by including symmetrized 
covariant derivatives of the curvature in Similarly, electromagnetic multipoles 
arise from an analogous construction based on the Faraday tensor F^jj. A quite 
exhaustive case is therefore 

0 / = {Riiua(3'> ^ pRiiua(3-> ^{a'^p)Rfiiyag^ • • • ? pRp)iy^ • • •}• 

(40) 
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Notice that the commutation of covariant derivatives results in curvature terms 
and that, e.g., = 2V(^aF/^)iy — 2V(^aFu)fi, which can be checked using 

= 0. Again the partial derivatives of Lm with respect to the can 
be called multipole moments. However, these multipoles and also are probably 
not unique, because Lm in not unique. For instance, contractions of covariant deriv- 
atives with can be written as A-derivatives and one can partially integrate them. 
Notice that Dixon’s multipole moments have the same symmetries as ours, but satisfy 
additional orthogonality relations to a timelike vector defined on the worldline. 

It is also possible to include a term proportional to in the Lagrangian, as 
this combination transforms into a total A-derivative under a gauge transformation 
of the electromagnetic potential A^^. It just leads to the well-known Lorentz force. 
However, in the present approach a part of the Lorentz force is hidden in the definition 
of Pn, making the equations of motion not manifestly gauge-invariant. 



4 Symmetries, Transformations, and Conditions 

In this section we discuss symmetries, conservations laws, various transformations 
of the action, and conditions it must fulfill. 



4.1 Symmetries and Conserved Quantities 

Action principles have the advantage that one can easily derive conserved quantities 
from the Noether theorem [30]. Here we are going to consider only symmetry trans- 
formations where the fields are not transformed. Further, we assume (5 A = 0, so the 
variational formula (24) together with (29) is still valid. 

On the one hand, we require that the Lagrangian transforms under such a symmetry 
into a total derivative 



5Lm = — , (41) 

dX 

without making use of the equations of motion. On the other hand, if we assume that 
the equations of motion hold, then only the total time derivatives from (24) with (29) 
inserted contribute to 6 Lm- These total derivatives are located in the last lines of (24) 
and (29). (The first lines of (24) and (29) vanishes because fields are not transformed 
here.) We therefore have the conservation law 



d 

dX 






wl 

I 




= 0 . 



(42) 



A simple example is given by the global symmetry under a change of the body- 
fixed frame. In order to make things even more simple, we assume that L m does not 
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depend on Aa^ and on the so that = 0. But Lm still implicitly depends on 
Aa^ through ^2^^. A constant infinitesimal Lorentz transformation of the body-fixed 
frame then reads 



5z‘' = 0 , 



(43) 



where is a constant infinitesimal antisymmetric matrix. Obviously is invari- 
ant under this transformation, so (41) is fulfilled with K = 0. Further, we have 
— Aa^ Ab^oo^^ and (42) reads 



2 d\ 



S^vAa^ Ai 



(44) 



As is arbitrary, we see that the components of the spin in the body-fixed frame 
Sab = S^j^Aa^Ab^ are constant. A corollary of this fact is that the spin length S is 
constant, where 2S'^ = SabS^^ = 

The next important example is a symmetry of the spacetime described by a Killing 
vector field = 0. (Notice that also C^Rabcd = 0 etc.) Other fields entering 

Lm are assumed to be invariant under this symmetry, too, e.g., = 0. We 

consider an infinitesimal shift of the worldline coordinate 



= e^, AAa" = -cC^Aa'' = cAa^V^C", = 0, (45) 

where e is an infinitesimal constant and we assume parallel transport of Aa^ along 
i.e., ^^VyAA^ = 0. Notice that the fields are not transformed, but their symmetry 
along is important. Recall that —eC^ generates an infinitesimal coordinate trans- 
formation. Therefore, is invariant under this transformation if all the variables it 
depends on, including the fields, would be transformed by — But the shift (45) 
only applies to Aa^, and so the result of (45) on Lm is exactly opposite to 
the case when all variables except A a^, and are transformed. These variables 
are all the fields, so the (5 Lm produced by (45) can be obtained by transforming all 
the fields using +eC^. But the fields were assumed to be invariant under this trans- 
formation. Hence we have argued that (45) is a symmetry of the action, 6Lm = 0, 
and K = 0. Combining (42) and (45), we find the conserved quantity 

:= = const, (46) 

where A©^^ = was used. It is interesting that this covers to all multipole 

orders. This was also shown in [31, p. 210] based on the equations of motion. 

A special kind of conserved quantities that is not covered by the Noether theorem 
here are mass-like quantities. We will see later on that masses enter the action as 
parameters and are therefore constant by assumption. 
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4.2 Legendre Transformations 



Before proceeding, it is worth to point out that of course not every conceivable 
Lagrangian Lm is acceptable. Some choices are mathematically inconsistent or phys- 
ically unacceptable for other reasons. Some Lagrangians Lm are technically more 
difficult to handle and it makes sense to assume simplifying conditions for Lm for 
a first study. One such assumption we make here is that the relation between spin 
and angular velocity is a bijection. Notice that this relation is fixed by Lm through 
our definition of in (13). A violation of our assumption can have the interesting 
implication that the spin supplementary condition follows from (13), see [22], but we 
will not considering this scenario here. The supplementary conditions are discussed 
in the next section. 

With this assumption on the relation between spin and angular velocity, we can 
solve for in terms of (and probably other variables). This allows a Legendre 
transformation in ^2^^, i.e.. 



V---] = 






(47) 



It is important that the spin is varied independently now. Notice that this notion of 
Legendre transformation is unusual in mechanics, as is not a time derivative, 
but a combination of time derivatives. Still Legendre transformations are applicable 
in much more generic situations, which is heavily used, i.e., in thermodynamics. 
The function Rm establishes the connection to Routhian approaches [32-34]. The 
Routhian is a mixture of a Hamiltonian and a Lagrangian. Here it is essentially the 
sum of Rm and the connection part in Notice that therefore the Routhian 

is not manifestly covariant and covariance only becomes apparent at the level of the 
equations of motion. In contrast, in our construction Rm is manifestly covariant. 

A consequence of reparametrization invariance is that Lm must be a homoge- 
neous function of degree one in all (first-order) A-derivatives. For our assumptions in 
Sect. 3.1 this applies only to ^2^^, and so Eulers theorem on homogeneous 
functions reads 



Lm = 



^Lm 






U1.M 





















This is a consequence of reparametrization-gauge invariance, so in this sense it is 
analogous to (21), which follows from coordinate-gauge invariance. Let us proceed 
with a Legendre transformation in This is more subtle, as the relation between 
and p/j, can not be a bijection. To see this, first notice that (48) can be interpreted 
as a constraint on the component of p^. This can be formulated as the famous 
mass- shell constraint 



+ M^ = 0 , 



(49) 
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where A4 is called dynamical mass and usually depends on the dynamical variables. 
Thus the momentum only has three independent components. On the other hand, 
has four independent components: three physical and one gauge degree of freedom 
due to reparametrization invariance in A. (If we would choose A to be the proper time, 
then just 3 components are independent. But the constraint = —1 makes the 
variational principle more subtle.) That is, the constraint (49) produces a mismatch in 
degrees of freedom between and so they can not be connected by a bijection. 

However, the Legendre transformation can in fact be generalized to the case where 
constraints appear. One can perform the Legendre transformation “as usual” if all 
constraints are added to the action using Lagrange multipliers [35, 36]. Here we 
need one Lagrange multiplier a for (49), which together with the three independent 
components of provides a total of four independent variables. This exactly matches 

the four independent degrees of freedom of u^. The Lagrange multiplier a isolates 
the reparametrization-gauge degree of freedom, while represents the physical 
degrees of freedom. Again we require Lm to be such that no pathologies for this 
“constraint” Legendre transformation arise. 

Finally, the result of the transformation is 



z'’, Pn, a, Sut,, Aa^, ■ ■ •] 

= [ dX 






(50) 



where 



M = M Pn, Aa^, Shi,, • (51) 

We assume that we can also Legendre transform in the 0^’ without giving rise to 
further constraints or pathologies. From the variations of p^, S^u, and we obtain 



= ap ^ + 



adM^ 

2 dpn 



= a 



dM^ 






a dM^ 
2 



(52) 



These are just the inverses to variable transformations used in the Legendre trans- 
formations. Because we did not touch the variables Aa^ and it is 

clear that 



8Lm _ 


adM^ 


8Lm _ 


a dM^ 


dg^y 


2 8gfjiy 


9Aa^ “ 


2 3Aa^’ 


8Lm _ 


a dM^ 


3Lm _ 


a dM^ 


d4>f ~ 


2 d4>f ’ 


“ 


2 Id 



(53) 



( 54 ) 
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The Lagrange multiplier a is determined by choosing a normalization for which 
corresponds to a gauge choice for A. For a given dynamical mass function M, one 
can then evaluate the equations of motion (31) and (32). 

Coming back to the plan outlined in the introduction, we have the option to 
construct either Lm, or in a phenomenological manner. Let us explore the 
last option here, e.g., because it promotes both and to dynamical variables, 
which are probably easier to identify in realistic situations compared to and 
. Further, it is suggestive that the mass M of the object as a function of the 
dynamical variables completely determines the macroscopic dynamics of the body. 
This situation is analogous to a thermodynamic potential (like the internal energy) 
describing the large-scale behavior of a thermodynamic system. This is the first 
indication that thinking in terms of thermodynamic analogies is very useful here. 



4.3 Supplementary Conditions 

The model for spinning bodies developed up to now comprises too many degrees 
of freedom. We expect three rotational degrees of freedom instead of six provided 
by the Lorentz frame Aa^. Similarly, the spin should only have three independent 
components, too. It is suggestive to impose that the time direction of the body-fixed 
frame is aligned to a (to be defined) rest frame described by a unit time-like vector 
r^, and that the spin only has spatial components in this rest frame, 

Ao^ = = 0. (55) 

One can also envision different time-like vectors in each of these conditions. How- 
ever, using the same vector seems to fit well to the interpretation of as a rest frame. 
The condition on the spin is usually called spin supplementary condition. Two spe- 
cific options are j ^ —UyU^ or — p^ j ^ —pvP^ . The latter condition is 

usually considered as the best choice, as it uniquely fixes the representative worldline 
of the extended object [37-39] (if Dixon’s definitions for the multipoles are used). A 
more detailed discussion of supplementary conditions is given in the contributions 
by D. Giulini, L.F. Costa and J. Natario. But notice that in fiat space the choice of 
this condition can be related to the choice of the representative worldline for the 
extended body. In curved spacetime this relation could not be established yet. From 
a careful perspective one should therefore reckon that different spin supplementary 
conditions may lead to inequivalent models. As long as the relation to the choice 
of center is not clarified for curved spacetimes, one must regard this condition as a 
constitutive relation of the model. For this reason, one should also avoid conditions 
which are not manifestly covariant. 

The most straightforward way to implement (55) into a given action is to add these 
conditions using Lagrange multipliers. In general, this will modify the dynamics by 
constraint forces. As in classical mechanics, one requires that (55) is preserved in 
time, which should fix the Lagrange multipliers. This can lead to inconsistencies. 
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in which case one should revise the action or the choice for It can also lead 
to further constraints, which we regard as unphysical here as they further reduces 
the number of independent variables (we want exactly three rotational degrees of 
freedom). Similarly, if some of the Lagrange multipliers remain undetermined, then 
the degrees of freedom are increased, which we also regard as unphysical. The last 
possibility is that the Lagrange multipliers are uniquely fixed by requiring that (55) 
is preserved. In the end, we can insert this solution for the Lagrange multipliers 
into the action. In this way we obtain an action without Lagrange multipliers which 
preserves (55). 



4.4 Conditions on the Dynamical Mass 



Having this said, we can try to directly construct an action which preserves (55). 
This approach is in fact very natural here. For instance, one can make an ansatz for 
and use this requirement to fix some of the coefficients. The first condition in 
(55) can be written as tjoa = and is preserved in time if 

0 = — + where (56) 

ClA oS^i/ 



Using (21) and (53), we can write the spin equation of motion (32) in the form 









OAa'^ 



(57) 



With the help of this equation, we see that the spin supplementary condition is 
preserved in time if it holds (56) and additionally 



0 = 



dM^ 

dA^ 






(58) 



This condition is often trivially fulfilled, namely when does not explicitly depend 
on Aa^. We are going to construct a simple example now, in order to show that 
functions Ai exist which are consistent with all of our requirements. 



4.5 A Simple Construction of the Dynamical Mass 

Instead of constructing an action which fulfills (56) and (58) for a specific choice of 
r^, one can look at a specific action and construct a such that the requirements (56) 
and (58) are fulfilled. Let us consider a simple example where is a nonconstant 
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analytic function / depending only on S'^ := i.e., = f(S^). It is 

clear that (58) is fulfilled, because there is no explicit dependence on the body-fixed 
frame. We still need to satisfy (56), which reads explicitly 



0 = 



Dr^^ 

~dX 



+ afS^^ry 



Dri^ 

~dX' 



(59) 



That is, the vector must be parallel transported along the worldline. This does 
indeed characterize a suitable spin supplementary condition, which was first dis- 
cussed in [40, Sect. 3.4]. Although, with this condition, lacks an immediate inter- 
pretation as a rest frame, the numerical results in [40] show that it leads to similar 
predictions as the choice / \J—PvP^ • Further discussions on this supple- 

mentary condition are given in other contributions to these proceedings. 

For the case of a black hole, the laws of black hole dynamics [41, Box 33.4] 
suggests that 



>IL = /(x") = »S+(2^- <“> 

where mo is the constant irreducible mass related to the horizon area. We can now 
have a look at the angular velocity with respect to asymptotic time, so we have 
OL — (for a body at rest). Evaluating (52), we find agreement with what is 

usually identified as the angular velocity of the horizon. This is a nice check for 
the consistency of the interpretation of our variables. Notice that the laws of black 
hole dynamics owe their name to their similarity to the laws of thermodynamics. 
Further, an action principle similar to the one presented here can be used to derive 
the so called first law of black hole binary dynamics [13]. Again we encounter the 
thermodynamic character of the approach. 

For objects other than black holes, we can derive M from the moment of inertia. 
One usually defines the moment of inertia / (5^) as the proportionality factor between 
spin and angular velocity, which can be read off from (52). Again we have a = 
M~^ = so (52) leads to the differential equation/"^ = /“^/^/Mts solution 

reads 



Af = f(S^) = 



mo + 




-|2 

dx 

21 (x) 



( 61 ) 



where the irreducible mass mo enters as an integration constant. For neutron stars, 
the function I (S^) can be obtained numerically, e.g., using the RNS code [42, 43]. 
Alternatively, one can numerically compute the gravitating mass M directly as a 
function of S for a fixed number of baryons in the star. It would be interesting to see 
if both methods lead to compatible results. It should be noted that both black holes 
and neutron stars posses a quadrupole (and other multipoles) when they are spinning, 
which was neglected here. It will be included in the next section. 
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Interestingly, it is implied by [44] that for the pole-dipole case one can construct a 
such that (56) and (58) are fulfilled for j ^—pvP^ without the need for 

approximations or truncations of Then, however, is not solely dependent 
on S. The details on this are left for a future work. 



5 Spin-Induced Quadrupole 

In this section, we are going to develop a simple phenomenological model for 
describing the spin-induced quadrupole of a star. This is the quadrupole of a star 
arising from a deformation away from spherical symmetry due to rotation. We start 
with a reasonable ansatz for The main idea for this ansatz is to include all 
possible covariant (general coordinate invariant) terms up to a certain power in spin 
and curvature. The unknown coefficients in this ansatz are then fixed by comparing 
to the Kerr metric and to numerical solutions for the gravitational field of a rotating 
neutron star. One should emphasize that a truncation of requires negligibly small 
interaction energies, not small multipoles. 



5.1 Construction of the Action 

We are going to include in our ansatz the quadratic order in spin and second order 
derivatives of the metric. This means that we include terms linear in the curvature and 
covariant derivatives of the curvature are not allowed. This implies that we exclude 
A-derivatives of the curvature for now, but this restriction will be loosened below. 
Symbolically we have which according to Sect. 3.4 implies that 

we neglect interaction terms involving octupole and higher multipoles. Finally, let 
us assume the absence of further worldline degrees of freedom in this section, or 
_ 0 ^ gQ fQj. ^ dependence of on Aa^. [Then (58) is already 

fulfilled.] 

The main task is to collect all possible interaction terms. One must take care of 
including only independent terms, which can by tricky due to the symmetries of 
the Riemann tensor. A procedure for this was applied to the construction of effec- 
tive Lagrangians or Routhians in [ 8 , 34, 45]. Instead, we are going to construct 
directly, but the arguments are essentially the same. We will follow a different 
approach to implement the spin supplementary condition, too, by making an ansatz 
for around the case 




-\- O(Rniyap) • 



(62) 



As a first simplification, one can replace Rni^a(3 by its tracefree version, the Weyl 
tensor The traces are given by := g^^R^au(3 and R := which 
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are related to the energy momentum tensor T through Einstein’s gravitational field 
equations 



= SttG . (63) 

The energy momentum tensor can contain contributions from fields penetrating the 
compact object, like electromagnetic or dark matter fields. We assume that these can 
be neglected, i.e., the bodies are mainly interacting via the gravitational field. But 
in the case of self-gravitating bodies, the energy momentum tensor also includes 
a singular contribution from the point-particle (39) itself. Let us assume that these 
singular self-interactions can be dropped. Then we can effectively make use of the 
vacuum field equations = 0 at the particle location, so we have Rfjiya(3 = 
However, in general one is not allowed to use field equations at the level of the action. 
But in the current context this is essentially a valid procedure, as it is equivalent to 
a field redefinition in the action, see [8, 26], or [46, Appendix A]. Without loss of 
generality, we can therefore restrict to = {C^ya(3) in the quadrupole case. 

The most important and most obvious requirement on the allowed interaction 
terms in Af ^ is general coordinate invariance. Further restrictions on the terms and 
transformations identifying equivalent terms (equivalent within our truncation) are: 

1. In four spacetime dimensions, the Weyl tensor can be split into an electric 
and a magnetic part with respect to a time-like unit vector. Choosing this 
vector to be r^, it holds 

Efiu = Cf,au/3r°'r'^, = ^ri^apaCui3'’'^r°'rl^ , (64) 

where r]^jya^ is the volume form. These tensors have the properties 

= E,^, E^,g^^ = 0, E^,r^ = 0, (65) 

= B^^, = 0, B^^r^ = 0. (66) 

These properties make E^j^ and B^j^ much easier to handle compared to 

2. We include only terms invariant under parity transformations. In this respect it is 

important to notice that B^y is of odd parity. We conclude that any terms with an 
odd number of magnetic Weyl tensors must also include an odd number of volume 
forms g/j,yag- Due to the antisymmetry of it will always be contracted with 

both indices of the spin at the current level of truncation. Then we can rewrite all 
terms involving in terms of the dual of the spin tensor 

^^a/3 •— 2^^ 



(67) 
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Notice that we have the identity [22, Eq. (A.7)] 

= ( 68 ) 

As a consequence, it holds = 0. It is customary to define a spin 

vector := rjy 

3. It should be noticed that one is in general not allowed to neglect terms involving 

the combination though these numerically vanish due to the spin sup- 

plementary condition (55): A variation of these terms can lead to nonvanishing 
contributions to the equations of motion. Instead, terms in the action which are at 
least quadratic in can be neglected, as their contributions to the equations 
of motion are at least linear in the spin supplementary condition and thus always 
vanish. 

4. The quadrupole interaction terms can be simplified using the leading order trun- 
cation of the mass-shell constraint = 0. (For the ansatz in (70) 

given below, this implies that we can set p^p^ ~ —p^m the higher order terms 
of M}.) This transformation does in fact just correspond to a redefinition of the 
Lagrange multiplicator ct, and the idea is therefore similar to the field redefinitions 
mentioned above [26] . 

5. Time derivatives of p^ and can also be removed by redefinitions of variables, 
which follows from the ideas in [26] and is again analogous to the mentioned 
field redefinitions. Besides that, the absence of higher order time derivatives was 
already assumed in Sect. 3.1. 

The last point also shows that our ansatz will automatically cover time derivatives 
of and of arbitrary order. As we work at linear level in the curvature, the time 
derivatives can always be removed from the curvature through partial integration. 
After this transformation, all time derivatives finally apply to p^ and S^jj only, which 
can be removed by virtue of the argument 5 above. This suggests that these terms 
belong to the quadrupole level, too, although time derivatives of the fields are in 
fact covariant derivatives = This is of course related to the ambiguity 

of the multipoles pointed out in Sect. 3.4. At linear level in the curvature, one can 
assume that the covariant derivatives are projected orthogonal to r^, because ~ 
= Z)/JA up to higher order terms, which can be partially integrated. 

The first point mentioned above suggests to include just and in 
However, this is currently not possible, because we assumed in Sect. 3.1 that the 
0field contain just fields, but r^^ in (64) is only defined on the worldline. For instance, 
one would have to clarify the meaning of arising from V^E^jj in (31). For 
simplicity, let us stick to here, but have in mind that depends 

on only through the combinations and B^j^. The equations of motion are 

initially expressed in terms the quadrupole moment related to 



( 69 ) 
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see (33), but these are at once related to the moments belonging to and 
through the chain rule. The interpretation of the latter moments as quadrupoles is 
much more obvious than for (69), as and are symmetric tracefree spatial 
tensors in the rest frame defined by These moments can be called electric and 
magnetic quadrupoles, respectively. They match the quadrupole degrees of freedom 
of the gravitational field outside the body [47], in contrast to (33), which in general is 
not tracefree. This approach to define electric and magnetic quadrupoles was briefly 
discussed in [1 1]. An explicit split into electric and magnetic quadrupoles at the level 
of the equations of motion was performed in [48]. 



5.2 Ansatz 



The most general ansatz for A4^ now reads 

M^ = M^ + + 0{E'^, b\ S^), (70) 

where we introduced the abbreviation We assume that ji, and 

are constants. Remember that within the curvature terms we can set ~ 
which is due to (62) and point 4 of the last section. Notice that /i must be a 
function of the constant spin length, /i^ = cf. (61). Otherwise the Legendre 

transformation would be problematic. Consistent with our truncation, we may write 

= ml + '^S^ + 0(S*), (71) 

where 1q = 1 (0) is the moment of inertia in a slow rotation limit 5^0. Furthermore, 
the constants C^^ip and will in general depend on fi and S. This is further 
discussed below. 

Next, we want to check if (56) is fulfilled. Notice that (56) is required to hold at 
linear order in spin only. For this purpose, let us make an ansatz for to linear order 

in S, 

= Crppf^ + GCrBsB^'^S^ + 0{E^, B^, S^). (72) 



The normalization = —1 leads to Cpp = 1. Inspecting (56), we see that most of 
the contributions from the Cr 5 5 -term are shifted to higher orders, namely quadratic 
level in spin: This is due to ^2 = 0(S) and Sjy = O(S^). The only C^^^-term linear 
in spin contains B^^. Though this is a derivative of the curvature, it is not of higher 
order, because we realized that our ansatz effectively also covers A-derivatives of the 
curvature. We conclude that CpBS = 0^ or ~ p^^. For calculating in (56), it is 
useful to rewrite (31) as 




— Bo,[pri/]S ) 



2K + 



dM^ ' 
dpu . 




dM^ 

d^' 



dX 



(73) 
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Fig. 1 The coefficient as a function of the dimensionless spin a = S/Gfj?, where /i is 
identified with the gravitating mass and is given by /i = 1.4Mq here. The data points were 
generated using the RNS code [42, 43], where a multipole extraction according to [49] was used. 
The labels SLy, APR, FPS, and AU refer to equations of state considered in [50] 



Finally, the condition (56) is fulfilled to linear order in spin if C^^ip = 2 in our 
ansatz (70). The condition (58) is of course also fulfilled. In summary, we must have 

Crp = 1 , CrBS — 0 ? ^BS^p ~ (^ 4 ) 

while is not determined by basic principles, but depends on the specific object. 
Instead of fixing algebraically like in (72), it would be interesting to view (56) as 
an evolution equation for in the future, analogous to (59) in the pole-dipole case. 

In Fig. 1 the numerical value of C ^^2 is shown as a function of the spin length for 
fixed mass fi but different neutron star models. It is apparent from the plot that Cp ^2 
is approximately independent of the spin length. However, one should be careful and 
check this assumption for the specific case of interest. This determination of is 
actually a simple example of a matching procedure. The quadrupole moment J of 
the effective point particle is parametrized through the ansatz (70) as / €^^ 28 ^. 

This is compared (or matched) to the quadrupole moment of a numeric neutron star 
spacetime computed with the RNS code [42, 43]. Here the quadrupole moment is 
identified through the exterior spacetime. This means that the effective point particle 
mimics the exterior spacetime of a numerically constructed neutron star model, which 
depends crucially on strong field effects in the interior. This makes an interesting 

indicator for both the neutron star equation of state and strong-field modifications of 
gravity. For black holes, a comparison with the Kerr metric leads to C ^^2 = 1. 

Finally, we come back to the thermodynamic analogy to our approach. The quadru- 
pole relation J ~ C ^^2 S'^ can be viewed as a simple (idealized) “equation of state” 
relating the macroscopic variables J and 5. As in the case of the ideal gas, this model 
can be improved to meet the required accuracy. This can be done systematically here 
by extending the ansatz (70) to higher orders. 
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5.3 Application 



As an application for the spin-induced quadrupole constructed in the last section, we 
consider the case of a test particle moving in a Kerr spacetime. This test particle can 
be characterized as a pole-dipole-quadrupole particle. We aim at an estimate for the 
relevance of the spin- squared contributions, so we may consider a specific orbital 
configuration that simplifies the discussion. This is obviously a circular orbit in the 
equatorial plane of the Kerr geometry. Let us further assume that the spin of the test 
body is aligned with the rotation axis of the background spacetime. 

In the absence of a quadrupole, these orbits can be constructed in a simple manner, 
which was first used in [51]. This method is in fact still applicable for the considered 
quadrupole model [52]. It requires that conserved quantities, spin supplementary 
condition, and constraints on the orbital configuration are enough to uniquely fix the 
10 dynamic variables contained in and . This is just an algebraic calculation, 
in contrast to solving the differential equations of motion. A numeric study for 
Schwarzschild spacetime is given in [53]. 

The spin supplementary condition pjj = 0) contains three independent equa- 
tions. The constraint on the orbit provides three further independent conditions: one 
due to equatorial orbits {p^ = 0) and two due to spin alignment = 0). So we 
need to identify 10 — 3 — 3 = 4 conserved quantities in order to solve for p^ and 
algebraically. Three conserved quantities were already identified in Sect. 4.1. 

These are the spin-length S := and the quantities derived from the two 

Killing vectors of Kerr spacetime {dt and de) through (46). Well call the latter two 
the energy E := Eq^ and total angular momentum := E-q^ of the particle. The 
last remaining conserved quantity is just the mass-like parameter //, which in the 
action approach is constant by assumption. However, one should remember that (70) 
is truncated and thus only approximately valid. One can equivalently say that p is 
only conserved approximately, corresponding to the truncation of (70). This point of 
view was taken in [52]. 

Now we are in a position to solve for pa and Most important is the equation 
for p^ . After some algebra [52], one finds that (p^)^ is given by a polynomial of 
second order in E. We denote the roots of this polynomial by t/+ and t/_, i.e., 

{p'-f oc (£ - U+){E - U-). (75) 

For p^ to be a real number, we need to have both E < U-\- and E < f/_, or both 
£’>!/+ and E > f/_. It turns out that the important relation is just £" > for the 
most relevant part of the parameter space. This justifies to call t/+ effective potential: 
The test body can only move in the region where E >Uj^ and its turning points are 
given by £■ = f/+, because then p^ =0 (which implies = 0, see [52]). Therefore 
the minimum of as a function of r defines circular orbits. This completes our 
construction. 

The various contributions to the dimensionless binding energy e := E j p — 1 
are plotted in Fig. 2 for the case of a very rapidly rotating (small) black hole in a 
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Fig. 2 Various corrections to the binding energy e for a maximally spinning (small) black hole in 
a Schwarzschild background. Here — S' is the orbital angular momentum. The mass ratio 

is formally taken to be ^ = 1 in the plot, though the result are only valid for ^ 1 . The curves 

can be scaled to the case of interest {q < 10“^): self-force and linear spin effects scale as a the 
others as a 



Schwarzschild background. A comparison with recent results for the conservative 
part of the self-force 154] is also included. In a Kerr background, the last stable 
circular orbit can be very close to the horizon, so that the discussed effects can be 
some orders of magnitude stronger. The reader is referred to 152] for a more complete 
discussion. 



6 Dynamical Quadrupole and Tidal Forces 



For the model developed in the last section, the quadrupole adiabatically follows the 
spin evolution. Thus, the quadrupole is not an independent dynamical variable. In 
this section, we are going to investigate dynamical quadrupoles, but restrict to the 
nonspinning case for simplicity. 



6.1 Basic Idea 

We have already discovered that the dynamical mass M plays a role similar to a 
thermodynamic potential. From this perspective, one can compare the variables it 
depends on, like and 5^^^, to thermodynamic state variables. Noticing that and 
are the monopole and dipole moment, a natural extension is to introduce dynam- 
ical “state” variables for other multipoles, too. A possible motivation arises from the 
realization that stars have oscillation modes and that these modes can be excited by 
tidal forces from an external time-dependent gravitational field. This phenomenon is 
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well understood in Newtonian gravity [55], see also [56-58] and references therein. 
If one wants to capture it by our approach, one obviously must introduce dynamical 
worldline variables corresponding to these oscillation modes. Suitable point-particle 
actions were already discussed in [45, 59], though with applications to absorption 
or binary systems in mind. 

The key to find a model for dynamical multipoles is to understand the reaction of 
the multipoles to external fields. We focus here on the response of the quadrupole 
to external tidal fields. In fact, we will encode the quadrupole dynamics in terms 
of a response function. This function can equivalently be called the propagator of 
the quadrupole [45], which better highlights the fact that it is a necessary ingredient 
for deriving predictions using perturbative calculations, e.g., in the post-Newtonian 
approximation. A third possible naming is correlation function between quadrupole 
and external field. This better accentuates the parallels to statistical mechanics or 
thermodynamics. The idea is that if one would be able to model the correlations of 
the most important multipoles among each other and with external fields, then one 
can in principle predict the motion of extended objects (with complicated internal 
structure) to any desired precision. 

It is important to notice that the multipole moments of a compact object can 
be defined through their exterior field. The response functions of the multipoles to 
externally applied tidal fields can therefore be obtained by analyzing the gravitational 
field outside of the body. The final goal is to extract these functions from numerical 
simulations of a single compact object. However, for a first simpler investigation one 
can restrict to linear perturbations of nonrotating compact objects. The unperturbed 
metric in the exterior is then just the Schwarzschild one. Because this metric is static 
and spherically symmetric, its linear perturbations can then be decomposed into 
Fourier basis in the time direction and spherical harmonic basis (0, (j)) in angular 
directions. Then their radial dependence is described by the famous Zerilli [60] or 
Regge-Wheeler [61] equations for electric- or magnetic-parity-type perturbations, 
respectively. The Zerilli equation can be transformed into the simpler Regge-Wheeler 
form [62], so we can focus just on the latter one. It reads 



where u is the frequency of the perturbation, I is the angular momentum quantum 
number, r is radial coordinate in the Regge-Wheeler gauge, Rs is the Schwarzschild 
radius (representing the mass of the body), r* = r Rs log(r/Rs — 1) is the tortoise 
radial coordinate, and X denotes the Regge-Wheeler master function. Given some 
boundary values for X at the surface of the body (which result from a solution to the 
more complicated interior perturbation equations), it is straightforward to integrate 
this equation numerically. The question is how one can decompose X into external 
(applied) tidal field and multipolar field generated by the body in response to the 
external field. This is a complicated problem in the general relativistic case. Let us 
therefore start with the Newtonian theory in order to get a better understanding of 
the problem [19]. 




(76) 
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6.2 Newtonian Case 



The Newtonian case can be obtained as a weak field and slow motion approxima- 
tion of general relativity. That is, we have to set = 0 (weak field) and uo = 0 
(slow motion) in (76). The perturbation of the Newtonian potential Opert can be 
reconstructed as 



Im 



d /(/ + 1) 

h ^ ^ 

dr 2r 



Xln 



(77) 



where the Ximu are solutions to the Newtonian limit of (76) for all values of the 
parameters /, m, and uo. 

The generic solution to the Newtonian limit of (76) reads 



X = Cir'+i +C2r“', 



(78) 



where Ci and C 2 are integration constants. The part diverges asymptotically, 
which means that its source is located at infinity. Therefore, C\ is the strength of the 
external field. Similarly, the r~^ part is singular at the origin and emanates from the 
compact body, so C 2 describes the /-polar field of the body. The frequency-domain 
response Fi of the multipoles to external fields is then proportional to the ratio of C 2 
and Cl. In the conventions used in [18, 19], it holds 



Fiiuj) = 



lil - 1) 

G(/-hl)(/ + 2)(2/ 



C 2 

l)!!Ci 



(79) 



This response must in general be computed numerically. The first step is to numer- 
ically solve the interior problem of a perturbed body, including the interior grav- 
itational field perturbation. Then the gravitational field is matched to (78) at the 
surface, which leads to numeric values for the integration constants and thus for the 
response (79). This response can in general acquire a complicated frequency depen- 
dence through the internal dynamics. Usually one defines normal oscillation modes 
by requiring that the body keeps up a multipolar field without external excitation, 
i.e., for Cl =0. Therefore the response (79) has a pole at normal mode frequencies. 

In the case of linear perturbations of a nonrotating barotropic star, the response 
turns out to be quite simple. For the quadrupolar case I = 2, the outcome is shown 
in Fig. 3. In fact, the form of the response can even be computed analytically and 
reads [19] 



n 



^nl 



^nl 



(80) 



This is just the sum of response functions of harmonic oscillators with resonance fre- 
quencies (poles) at uoni • Here n labels the type and overtone number of the oscillation 
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Fig. 3 Response function of 
the quadmpole, / = 2, for a 
one solar mass star. The 
equation of state is a 
polytrope with index 1 and 
such that the radius R is 
17.7km in the Newtonian 
case or 15.7km in the 
relativistic case 
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modes. The constants Ini are the so called overlap integrals, which here simply take 
the role of coupling constants between the oscillators and the external driving forces. 
As a consequence, the internal dynamics can be captured by an effective action 
through just a set of harmonic oscillators, which are coupled to the tidal force of the 
gravitational field [19] (with coupling constants Ini). By fitting the numeric result 
for Fi to (80), one can extract the constants uonl and Ini. 

It is worth to point out that the presented Newtonian setup is simple enough to 
perform explicitly the effective field theory procedure of integrating out small scales, 
see [ 19] . This turns a compact fluid configuration into a point particle on macroscopic 
scales. 



6.3 Relativistic Case at Zero Frequency 

Let us now return to the relativistic case, but restrict to even parity and the adiabatic 
case uo = 0. The connection between the relativistic tidal constants defined in [15-17, 
63] and the response function is given by a Taylor-expansion, 

— — = /i/ + iXiuj + + 0(uj^), (81) 

see [19]. Here the constants /// are named after the astronomer A.E.H. Love, who 
introduced them for tidal effects in the Earth-Moon system. A dimensionless version 
of the Love numbers /i/ is often defined as 

(21 - 1 )!! 

where R is the radius of the star. The A/ -term in (81) is related to absorption [45] and 
/i 2 was introduced in [63]. 
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Fig. 4 Dimensionless 
quadrupolar Love number ^2 
as a function of the 
compactness c = Rs /2R for 
two different equations of 
state (SLy, FPS) 




It remains to define how the response should be computed in the adiabatic rel- 
ativistic case. First, we again solve (76), this time for cj = 0, and find an analytic 
result in terms of the Gauss hypergeometric function 2 F 1 , 

X = Cir^+1 iFii-l -2,2-1, -21; Rs/r) 

+ Cir-' 2 Fi{l - U + 3, 2(1 + 1): Rs/r), (83) 

see, e.g., [64]. Again we can obtain numeric values for the integration constants by 
solving the perturbation equations inside the body and then match the gravitational 
field to (83) at the surface. In the limit of 1/r ^ 0 the hypergeometric functions 
are equal to 1, so (83) turns into (78). This implies that the interpretation of the 
integration constants as magnitudes of external field and response is still valid. The 
even-parity response in the adiabatic case uj = 0 then follows from (79) as before. A 
plot of the outcome in terms of the dimensionless Love number k 2 is given in Fig. 4. 
An extension of the application from Sect. 5.3 to adiabatic tidal deformations can be 
found in [52]. 

For integer values of /, the hypergeometric functions in (83) turn into polynomials 
(which possibly contain logarithms). Then one might worry that the exponents on r 
from the two independent solutions in (83) can overlap and spoil an unique identifi- 
cation of external field and response. However, this is avoided by examining X for 
generic values of /, in the sense of an analytic continuation. This is in spirit similar 
to working in generic dimension, as done in [64] . 



6.4 Relativistic Case for Generic Frequency 

We now turn our attention to the case of generic frequency in the even parity sector 
[18]. One can still solve (76) analytically [65], this time in terms of a series involving 
hypergeometric functions. We write the generic solution schematically as 
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X = AiZ',st + A2^mst'^ (84) 

where we denote the solution from [65] by a subscript MST. 

Note here that ^mst ^mst^ ^ r~\ which means that (79) essentially 

still works. Of course, one has to take into account the normalization of the Xmst 
in order to rewrite the C/ in (79) in terms of the A/. This introduces complicated 
cj-dependent corrections into (79). These are computed through a matching of the 
asymptotic field of the extended body to the field of the point-particle model. The 
details of this procedure can be found in [18]. The basic steps are as follows: 

• The field of the effective theory is obtained from an inhomogeneous version of (76) 
with a point particle source. It is understood that the post-Minkowskian approxi- 
mation is applied, as this removes the singular point of (76) at the Schwarzschild 
radius. The explicit form of the source term derives from (39). 

• The solution to the inhomogeneous equation is constructed from the homogeneous 
solution (84) using the method of variation of parameters. This method involves 
integrals over products of singular source and the Xmst- The integration constants 
just represent a generic solution to the homogeneous solution that can always be 
added. 

• Here the integration constants must be restricted further. Due to the singular behav- 
ior of the differential equation at r = 0, the homogeneous solution might actually 
not be homogeneous at r = 0. But the externally applied field is homogeneous 
everywhere, including r = 0. The restriction of the integration constants is there- 
fore equivalent to the identification of the external part of the field and the part 
generated by the particle. 

• Notice that an /-pole source involves I partial derivatives of a delta distribution. 

This suggests to identify the self-field by r~^ and the external field by 

^MST dimensional reasons. Here the idea of analytic continuation in I is 

again crucial. 

• The integrals arising in the variation of parameters are actually singular. This is not 
surprising, as the self-field of point-particles always leads to this kind of problem. 
A regularization method must be introduced. 

These steps lead to a refined (frequency dependent) version of (79) expressing the 
response function in terms of A\ and A 2 . The final step is again to obtain numeric 
values for A\ and A 2 for an actual (extended) neutron star. 

The result for the general relativistic response function is shown in Fig. 3. It 
can still be fitted by (80) very well. This implies that the internal dynamics can be 
approximated by a set of harmonic oscillators. Restricting to the quadrupolar level 
/ = 2 for simplicity, this translates to a dynamical mass of the form 

sa ^^2 -I- ^ ^ + 2InE^^ (f)nAB'^ , (85) 

n 



where the internal dynamical variables (pnAB and only have spatial components 
in the body-fixed frame (0^05 = 0 = V^n^^) and are symmetric tracefree in the 
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indices A and B. The dynamical equations for the quadrupolar worldline variables 
can be extracted from (52), (29), and (54), 



a dM^ j a dM^ 



( 86 ) 



In the linear perturbation regime, the contributions of the internal dynamical variables 
are small compared to /i^. The index n still labels the type of the oscillation mode. 
The mass quadrupole is the coefficient in front of i.e., Qab •= Sn 
Now the frame enters through so we need to check if (58) is 

fulfilled. Using Aq^ = and = 0 it is easy to see that this is the case. In 

fact, (58) is always fulfilled if the time direction of the body-fixed frame Aq^ drops 
out of the action. 

Some final remarks on the problem of regularization of point particles are in order. 
It was shown already in [59] that the quadrupole diverges at order in dimensional 
regularization. It is therefore not surprising that poles appear in the generalization 
of (79) at order which must be subtracted within some renormalization scheme. 
At the same time, the poles give rise to an explicit appearance of a renormalization 
scale parameter, which in a sense parametrizes the ambiguity in the choice of the 
renormalization scheme. An important point is that this scale parameter is in fact 
fixed by the requirement that the response function has an asymptotic behavior for 
uj ^ OQ compatible with (80). Different regularization and renormalization schemes 
will in general lead to slightly different numeric values for this scale parameter. 
However, within a given scheme its value can be uniquely matched and is therefore 
not ambiguous. In this sense, the regularization and renormalization scheme is a part 
of the phenomenological model. 



7 Conclusions 

We considered point-particle models for extended bodies in gravity, in particular for 
black holes and neutron stars. The multipoles of the point particles are adjusted such 
that their field predicted from a weak field approximation matches an exact/numerical 
solution for the extended object in question. This incorporates strong field effects 
from the interior of the extended object in the model. This is of particular impor- 
tance when binary systems are considered using weak field approximations, e.g., for 
gravitational wave source modeling or pulsar timing. 

Therefore, point-particle actions are far more powerful than what was probably 
envisioned when they were first investigated [5,6]. The resulting equations of motion 
are similar to Dixon’s results. Here we developed astrophysical realistic models 
for the multipoles in these equations. The latest development is the inclusion of 
oscillation modes in relativistic tidal interaction of neutron stars. 
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An interesting topic not discussed here are universal relations for various neutron 
star properties. Here “universal” refers to an approximate independence among var- 
ious proposed realistic equations of state. In [66, 67] universal relations between the 
dimensionless moment of inertia I the quadrupolar Love number fi^/G^ii^ , 
and the quadrupole constant were found and coined I-Love-Q relations. Fur- 
ther investigations, also including higher multipoles, followed shortly afterwards 
[68-73]. This indicates that coefficients in (70) arising at higher orders are actually 
not independent, but are (approximately) fixed by universality. (For black holes, in 
fact all coefficients are fixed, which is guaranteed by the no hair theorem.) This 
makes the expansion (70) a meaningful tool to study the impact of the equation of 
state on observations, as predictions of the effective model are then parametrized by 
only a small set of constants. 

The most interesting development for the future is probably the description of 
oscillation modes for rotating bodies, which can be tried in a slow rotation approxima- 
tion. It is also interesting to investigate if universal properties hold for the ingredients 
of the response function, e.g., for the overlap integrals. 
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Testing the Motion of Strongly 
Self-Gravitating Bodies with Radio Pulsars 



Norbert Wex 



Abstract Before the 1970s, precision tests for gravity theories were constrained to 
the weak-field environment of the Solar System. In terms of relativistic equations 
of motion, the Solar System gave access to the first order corrections to Newtonian 
dynamics. Testing anything beyond the first post-Newtonian contributions was for 
a long time out of reach. The discovery of the first binary pulsar by Russell Hulse 
and Joseph Taylor in the summer of 1974 initiated a completely new field for testing 
the relativistic dynamics of gravitationally interacting bodies. For the first time the 
back reaction of gravitational wave emission on the binary motion could be stud- 
ied. Furthermore, the Hulse-Taylor pulsar provided the first test bed for the orbital 
dynamics of strongly self-gravitating bodies. To date, there are a number of binary 
pulsars known which can be utilized to test different aspects of relativistic dynamics. 
So far GR has passed these tests with flying colors, while many alternative theories, 
like scalar-tensor gravity, are tightly constraint by now. This article gives an intro- 
duction to gravity tests with pulsars, and summarizes some of the most important 
results. Furthermore, it gives a brief outlook into the future of this exciting field of 
experimental gravity. 



1 Introduction 



In about two years from now we will be celebrating the centenary of Einstein’s 
general theory of relativity. On November 25th 1915 Einstein presented his field 
equations of gravitation (without cosmological term) to the Prussian Academy of 
Science [1]. With this publication, general relativity (GR) was finally completed as 
a logically consistent physical theory ( ''Damit ist endlich die allgemeine Relativitdt- 
stheorie als logisches Gebdude abgeschlossen. ”). Already one week before, based 
on the vacuum form of his field equations, Einstein was able to show that his the- 
ory of gravitation naturally explains the anomalous perihelion advance of the planet 
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Mercury [2] . While in hindsight this can be seen as the first experimental test for GR, 
back in 1915 astronomers were still searching for a Newtonian explanation 13]. In 
his 1916 comprehensive summary of GR 14], Einstein proposed three experimental 
tests: 

• Gravitational redshift. 

• Light deflection. 

• Perihelion precession of planetary orbits. 

Gravitational redshift, a consequence of the equivalence principle, is common to all 
metric theories of gravity, and therefore in some respect its measurement has less 
discriminating power than the other two tests 15] . The first verification of gravitational 
light bending during the total eclipse on May 29th 1919 was far from being a high 
precision test, but clearly decided in favor of GR, against the Newtonian prediction, 
which is only half the GR value 16]. In the meantime this test has been greatly 
improved, in the optical with the astrometric satellite HIPPARCOS 17], and in the 
radio with very long baseline interferometry 18, 9]. The deflection predicted by GR 
has been verified with a precision of 1 .5 x 10“"^. An even better test for the curvature 
of spacetime in the vicinity of the Sun is based on the Shapiro delay, the so-called 
“fourth test of GR” 110]. A measurement of the frequency shift of radio signals 
exchanged with the Cassini spacecraft lead to a 10“^ confirmation of GR 111]. Apart 
from the four “classical” tests, GR has passed many other tests in the Solar system 
with flying colors: Lunar Laser Ranging tests for the strong equivalence principle and 
the de-Sitter precession of the Moon’s orbit 112], the Gravity Probe B experiment for 
the relativistic spin precession of a gyroscope (geodetic and frame dragging) 113], 
and the Lense-Thirring effect in satellite orbits 114], just to name a few. 

GR, being a theory where fields travel with finite speed, predicts the existence of 
gravitational waves that propagate with the speed of light 115] and extract energy 
from (non-axisymmetric) material systems with accelerated masses 116]. This is also 
true for a self-gravitating system, where the acceleration of the masses is driven by 
gravity itself, a question which was settled in a fully satisfactory manner only several 
decades after Einstein’s pioneering papers (see 117] for an excellent review). This 
fundamental property of GR could not be tested in the slow-motion environment 
of the Solar system, and the verification of the existence of gravitational waves 
had to wait until the discovery of the first binary pulsar in 1974 118]. Also, all the 
experiments in the Solar system can only test the weak-field aspects of gravity. The 
spacetime of the Solar system is close to Minkowski space everywhere: To first 
order (in standard coordinates) the spatial components of the spacetime metric can 
be written as gij = (1 — 2^/ c^)Sij, where O denotes the Newtonian gravitational 
potential. At the surface of the Sun one finds ~ —2 x 10“^, while at the 

surface of a neutron star ~ —0.2. Consequently, gravity experiments with 

binary pulsars, not only yielded the first tests of the radiative properties of gravity, 
they also took our gravity tests into a new regime of gravity. 

To categorize gravity tests with pulsars and to put them into context with other 
gravity tests it is useful to introduce the following four gravity regimes: 
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G1 




G2 




G3 




GW 



Fig. 1 Illustration of the different gravity regimes used in this article 



G1 Quasi-stationary weak-field regime: The motion of the masses is slow compared 
to the speed of light {v c) and spacetime is only very weakly curved, i.e. close 
to Minkowski spacetime everywhere. This is, for instance, the case in the Solar 
system. 

G2 Quasi-stationary strong-field regime: The motion of the masses is slow com- 
pared to the speed of light (i; c), but one or more bodies of the system are 

strongly self-gravitating, i.e. spacetime in their vicinity deviates significantly 
from Minkowski space. Prime examples here are binary pulsars, consisting of 
two well- separated neutron stars. 

G3 Highly -dynamical strong-field regime: Masses move at a significant fraction 
of the speed of light (v ^ c) and spacetime is strongly curved and highly 
dynamical in the vicinity of the masses. This is the regime of merging neutron 
stars and black holes. 

GW Radiation regime: Synonym for the collection of the radiative properties of 
gravity, most notably the generation of gravitational waves by material sources, 
the propagation speed of gravitational waves, and their polarization properties. 

Figure 1 illustrates the different regimes. Gravity regime G1 is well tested in the Solar 
system. Binary pulsar experiments are presently our only precision experiments for 
gravity regime G2, and the best tests for the radiative properties of gravity (regime 
GW).^ In the near future, gravitational wave detectors will allow a direct detection 
of gravitational waves (regime GW) and probe the strong and highly dynamical 
spacetime of merging compact objects (regime G3). As we will discuss at the end 
of this article, pulsar timing arrays soon should give us direct access to the nano- 
Hz gravitational wave band and probe the properties of these ultra-low-frequency 
gravitational waves (regime GW). 



1.1 Radio Pulsars and Pulsar Timing 



Radio pulsars, i.e. rotating neutron stars with coherent radio emission along their 
magnetic poles, were discovered in 1967 by Jocelyn Bell and Antony Hewish [20]. 
Seven years later, Russell Hulse and Joseph Taylor discovered the first binary pulsar, 
a pulsar in orbit with a companion star [18]. This discovery marked the beginning of 



^ Gravitational wave damping has also been observed in a double white-dwarf system, which 
has an orbital period of just 13 min [19]. This experiment combines gravity regimes G1 (note, 
r/c ~ 3 X 10-3) and GW of Fig. 1. 
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Fig. 2 The P-P diagram for 
radio pulsars. Binary pulsars 
are indicated by a red circle. 

Pulsars that play a particular 
role in this article are marked 
with a green dot and have 
their name as a label. The 
data are taken from the 
ATNF Pulsar Catalogue [21] 

T3 




gravity tests with radio pulsars. Presently, more than 2000 radio pulsars are known, 
out of which about 1 0 % reside in binary systems [21]. The population of radio pulsars 
can be nicely presented in a diagram that gives the two main characteristics of a 
pulsar: the rotational period P and its temporal change P due to the loss of rotational 
energy (see Fig. 2). Fast rotating pulsars with small P (millisecond pulsars) appear 
to be particularly stable in their rotation. On long time-scales, some of them rival 
the best atomic clocks in terms of stability [22, 23]. This property makes them ideal 
tools for precision astrometry, and hence (most) gravity tests with pulsars are simply 
clock comparison experiments to probe the spacetime of the binary pulsar, where the 
“pulsar clock” is read off by counting the pulses in the pulsar signal (see Fig. 3). As 
a result, a wide range of relativistic effects related to orbital binary dynamics, time 
dilation and delays in the signal propagation can be tested. The technique used is 
the so-called pulsar timing, which basically consists of measuring the exact arrival 
time of pulses at the radio telescope on Earth, and fitting an appropriate timing model 
to these arrival times, to obtain a phase-connected solution. In the phase-connected 
approach lies the true strength of pulsar timing: the timing model has to account for 
every (observed) pulse over a time scale of several years, in some cases even several 
decades. This makes pulsar timing extremely sensitive to even tiny deviations in the 
model parameters, and therefore vastly superior to a simple measurement of Doppler- 
shifts in the pulse period. Table 1 illustrates the current precision capabilities of pulsar 
timing for various experiments, like mass determination, astrometry and gravity tests. 
We will not go into the details of pulsar observations and pulsar timing here, since 
there are numerous excellent reviews on these topics, for instance [24, 25], just to 
mention two. In this article we focus on the relativistic effects that play a role in 
pulsar-timing observations, and how pulsar timing can be used to test gravitational 
phenomena in generic as well as theory -based frameworks. 
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Fig. 3 Spacetime diagram illustration of pulsar timing. Pulsar timing connects the proper time of 
emission Tpsr, defined by the pulsar’s intrinsic rotation, and the proper time of the observer on Earth 
Tobs, measured by the atomic clock at the location of the radio telescope. The timing model, which 
expresses robs as a function of Tpsr, accounts for various “relativistic effects” associated with the 
metric properties of the spacetime, i.e. the world line of the pulsar and the null-geodesic of the 
radio signal. In addition, it contains a number of terms related to the Earth motion and relativistic 
corrections in the Solar system, like time dilation and signal propagation delays (see [26] for details) 



Table 1 Examples of precision measurements using pulsar timing 



Rotational period 


5.757451924362137(2)ms 


[27] 


Orbital period 


0.102251562479(8) d 


[*] 


Small eccentricity 


(3.5 ± 1.1) X 10-’ 


[28] 


Distance 


157(l)pc 


[27] 


Proper motion 


140.915(1) mas yr“^ 


[27] 


Masses of neutron stars 


rup = 1.4398(2) Mo 


[29] 


ruc = 1.3886(2) Mo 


[29] 


Mass of millisecond pulsar 


1.667(7) Mo 


[30] 


Mass of white-dwarf companion 


0.207(2) Mo 


[31] 


Mass of Jupiter and moons 


9.547921(2) x 10“4 Mq 


[32] 


Relativistic periastron advance 


4.226598(5) deg yr-' 


[29] 


Gravitational wave damping 


0.504(3) pico-Hz yr“ ^ 


[*] 


GR validity (observed/GR) 


1.0000(5) 


[*] 



A number in bracket indicates the (one-sigma) uncertainty in the last digit of each value. The symbol 
Mq stands for the Solar mass. (cf. Table 1 in [33]) 

[*] Kramer et al., in prep. 
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1.2 Binary Pulsar Motion in Gravity Theories 



While in Newtonian gravity there is an exact solution to the equations of motion 
of two point masses that interact gravitationally, no such exact analytic solution is 
known in GR. In GR, the two-body problem has to be solved numerically or on 
the basis of approximation methods. A particularly well established and successful 
approximation scheme, to tackle the problem of motion of a system of well- separated 
bodies, is the post-Newtonian approximation, which is based on the weak-field slow- 
motion assumption. However, to describe the motion and gravitational wave emission 
of binary pulsars, there are two main limitations of the post-Newtonian approximation 
that have to be overcome (cf. 134]): 

(A) Near and inside the pulsar (and its companion, if it is also a neutron star) the 
gravitational field is strong and the weak-field assumption no longer holds. 

(B) When it comes to generation of gravitational waves (of wavelength Aqw) and 
their back-reaction on the orbit (of size r and period R^), the post-Newtonian 
approximation is only valid in the near zone (r ^ Agw = cFhjT), and breaks 
down in the radiation zone (r > Agw) where gravitational waves propagate and 
boundary conditions are defined, like the ‘no incoming radiation’ condition. 

The discovery of the Hulse-Taylor pulsar was a particularly strong stimulus for the 
development of consistent approaches to compute the equations of motion for a binary 
system with strongly self-gravitating bodies (gravity regime G2). As a result, by now 
there are fully self-consistent derivations for the gravitational wave emission and the 
damping of the orbit due to gravitational wave back-reaction for such systems. In fact, 
in GR, there are several independent approaches that lead to the same result, giving 
equations of motion for a binary system with non-rotating components that include 
terms up to 3.5 post-Newtonian order (i;^/c^) 135, 36]. For the relative acceleration 
in the center-of-mass frame one finds the general form 



GM 



r = — 



(1 H- A 2 + A 4 H- A 5 H- H- A 7 ) - 

r 

+ (B2 + B4 + B5 + B6 + B-,) r] 



( 1 ) 



where the coefficients and are of order c~^, and are functions of r = |r|, r, 
V = |r|, and the masses (see 135] for explicit expressions). The quantity M denotes 
the total mass of the system. At this level of approximation, these equations of 
motion are also applicable to binaries containing strongly self-gravitating bodies, 
like neutron stars and black holes. This is a consequence of a remarkable property 
of Einstein’s theory of gravity, the ejfacement of the internal structure 134, 37]: In 
GR, strong-field contributions are absorbed into the definition of the body’s mass. 

In GR’s post-Newtonian approximation scheme, gravitational wave damping 
enters for the first time at the 2.5 post-Newtonian level (order v^/c^), as a term 
in the equations of motion that is not invariant against time-reversal. The corre- 
sponding loss of orbital energy is given by the quadrupole formula, derived for the 
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first time by Einstein within the linear approximation, for a material system where 
the gravitational interaction between the masses can be neglected [16]. As it turns 
out, the quadrupole formula is also applicable for gravity regime G2 of Fig. 1, and 
therefore valid for binary pulsars as well (cf. [34]). 

In alternative gravity theories, the gravitational wave back-reaction, generally, 
already enters at the 1.5 post-Newtonian level (order This is the result of the 

emission of dipolar gravitational waves, and adds terms A 3 and to Eq. (1) [5, 38]. 
Furthermore, one does no longer have an effacement of the internal structure of a com- 
pact body, meaning that the orbital dynamics, in addition to the mass, depends on the 
“sensitivity” of the body, a quantity that depends on its structure/compactness. Such 
modifications already enter at the “Newtonian” level, where the usual Newtonian 
gravitational constant G is replaced by a (body-dependent) effective gravitational 
constant Q. For alternative gravity theories, it therefore generally makes an impor- 
tant difference whether the pulsar companion is a compact neutron star or a much 
less compact white dwarf. In sum, alternative theories of gravity generally predict 
deviations from GR in both the quasi- stationary and the radiative properties of binary 
pulsars [39, 40]. 

At the first post-Newtonian level, for fully conservative gravity theories with- 
out preferred location effects, one can construct a generic modified Einstein-lnfeld- 
Hojfmann Lagrangian for a system of two gravitationally interacting masses mp 
(pulsar) and (companion) at relative (coordinate) separation r = |x^ — x^l and 
velocities \p = Xp and = x^ : 



Lo = 



+ ■ 



Qmpinc 



r _ V/7 • Vc _ (r-\p)(r- 

|_ 2c^ 2c^r^ 



(r-\p)(r-\c) . (Vn-Vc) 



2n 






Q^Mmpmc 

2c^r^ 



( 2 ) 



where M = mp -\- Mc. The body-dependent quantities Q, e and ^ account for devia- 
tions from GR associated with the self-energy of the individual masses [5, 39]. In GR 
one simply finds Q = G, e = 2, and ^ = 1. There are various analytical solutions to 
the dynamics of (2). The most widely used in pulsar astronomy is the quasi-Keplerian 
parametrization by Damour and Deruelle [41]. It forms the basis of pulsar- timing 
models for relativistic binary pulsars, as we will discuss in more details in Sect. 1.4. 

Beyond the first post-Newtonian level there is no fully generic framework for the 
gravitational dynamics of a binary system. However, one can find equations of motion 
valid for a general class of gravity theories, like in [42] where a framework based 
on multi-scalar-tensor theories is introduced to discuss tests of relativistic gravity to 
the second post-Newtonian level, or in [43] where the explicit equations of motion 
for non-spinning compact objects to 2.5 post-Newtonian order for a general class of 
scalar-tensor theories of gravity are given. 
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1.3 Gravitational Spin Effects in Binary Pulsars 



In relativistic gravity theories, in general, the proper rotation of the bodies of a 
binary system directly affects their orbital and spin dynamics. Equations of motion 
for spinning bodies in GR have been developed by numerous authors, and in the 
meantime go way beyond the leading order contributions (for reviews and references 
see, e.g., [34, 44-46]). For present day pulsar-timing experiments it is sufficient to 
have a look at the post-Newtonian leading order contributions. There, one finds three 
contributions: the spin-orbit (SO) interaction between the pulsar’s spin and the 
orbital angular momentum L, the SO interaction between the companion’s spin Sc 
and the orbital angular momentum, and finally the spin- spin interaction between the 
spin of the pulsar and the spin of the companion [44] . 

Spin-spin interaction will remain negligible in binary pulsar experiments for the 
foreseeable future. They are many orders of magnitude below the second post- 
Newtonian and spin-orbit effects [47], and many orders of magnitude below the 
measurement precision of present timing experiments. For this reason, we will not 
further discuss spin-spin effects here. 

For a boost invariant gravity theory, the (acceleration-dependent) Fagrangian for 
the spin-orbit interaction has the following general form (summation over spatial 
indices /, 7 ) 



1 ,• T ^ T 4 W 5 • • j j 

+ X - 4) > 

2 t^A ’'aB 

(3) 

where 5^ = is the antisymmetric spin tensor of body A [34, 39, 48]. The 

coupling function can also account for strong-field effects in the spin-orbit 
coupling. In GR = 2G. For bodies with negligible gravitational self-energy, 
one finds in the framework of the parametrized post-Newtonian (PPN) formalism^ 
= (7PPN + 1)G, a quantity that is actually most tightly constrained by the light- 
bending and Shapiro-delay experiments in the Solar system, which test 7 ppn [ 8 , 9, 
11, 49]. 

In binary pulsars, spin-orbit coupling has two effects. On the one hand, it adds 
spin-dependent terms to the equations of motion (1), which cause a Fense-Thirring 
precession of the orbit (for GR see [44, 50]). So far this contribution could not be 
tested in binary pulsar experiments. Prospects of its measurement will be discussed 
in the future outlook in Sect. 5. On the other hand it leads to secular changes in the 
orientation of the spins of the two bodies (geodetic precession), most importantly 
the observed pulsar in a pulsar binary [44, 51, 52]. As we discuss in more details in 
Sect. 3, a change in the rotational axis of the pulsar causes changes in the observed 



I V ^ i 

^SO(X/l, Va,3a) = — 



^The PPN formalism uses 10 parameters to parametrize in a generic way deviations from GR at the 
post-Newtonian level, within the class of metric gravity theories (see [5] for details). 
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emission properties of the pulsar, as the line-of- sight gradually cuts through different 
regions of the magnetosphere. 

As can be derived from (3), to first order in GR the geodetic precession of the 



where = 27 t/R^ and Vb = (GMub)^^^. 

It is expected that in alternative theories relativistic spin precession generally 
depends on self-gravitational effects, meaning, the actual precession may depend on 
the compactness of a self-gravitating body. For the class of theories that lead to the 
Lagrangian (3), Eq. (4) modifies to 



where Vt = (GMub)^^^ is the strong-field generalization of Vb. 

Effects from spin-induced quadrupole moments are negligible as well. For double 
neutron-star systems they are many orders of magnitude below the second post- 
Newtonian and spin-orbit effects, due to the small extension of the bodies [47]. If 
the companion is a more extended star, like a white dwarf or a main-sequence star, 
the rotationally-induced quadrupole moment might become important. For all the 
binary pulsars discussed here, the quadrupole moments of pulsar and companion are 
(currently) negligible. 

Finally, certain gravitational phenomena, not present in GR, can even lead to a 
spin precession of isolated pulsars, for instance, a violation of the local Forentz 
invariance in the gravitational sector, as we will discuss in more details in Sect. 4. 



1.4 Phenomenological Approach to Relativistic Effects 
in Binary Pulsar Observations 

For binary pulsar experiments that test the quasi- stationary strong-field regime (G2) 
and the gravitational wave damping (GW), a phenomenological parametrization, 
the so-called ‘parametrized post-Keplerian’ (PPK) formalism, has been introduced 
by Damour [53] and extended by Damour and Taylor [39]. The PPK formalism 
parametrizes all the observable effects that can be extracted independently from 
binary pulsar timing and pulse- structure data. Consequently, the PPK formalism 
allows to obtain theory-independent information from binary pulsar observations by 
fitting for a set of Keplerian and post-Keplerian parameters. 



pulsar, averaged over one orbit, is given by (L = L/|L|) 




l-e^[^2mp\ m2 c 2 ’ 



( 4 ) 




( 5 ) 
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The description of the orbital motion is based on the quasi- Keplerian parame- 
trization of Damour and Deruelle, which is a solution to the first post-Newtonian 
equations of motion [41, 54]. The corresponding Roemer delay in the arrival time of 
the pulsar signals is 



r 9 9i^/^ 

Ar = X sin ijj [cos U — e{\ -\- (5^)] + x cos uj\ \ — e {\ -\- 5$) sin f/, (6) 

where the eccentric anomaly U is linked to the proper time of the pulsar T via the 
Kepler equation 



The five Keplerian parameters e, x, uj, and Tq denote the orbital period, the 
orbital eccentricity, the projected semi-major axis of the pulsar orbit, the longitude 
of periastron, and the time of periastron passage, respectively. The post-Keplerian 
parameter 5r is not separately measurable, i.e. it can be absorbed into other timing 
parameters, and the post-Keplerian parameter 5e has not been measured up to now 
in any of the binary pulsar systems. The relativistic precession of periastron changes 
the the longitude of periastron uj according to 



uo = uoq-\- uo — arctan 

7T 




( 8 ) 



meaning, that averaged over a full orbit, the location of periastron shifts by an angle 
ujPb. The parameter uj is the corresponding post-Keplerian parameter. A change in 
the orbital period, due to the emission of gravitational waves, is parametrized by the 
post-Keplerian parameter Pb. 

Besides the Roemer delay Ar, there are two purely relativistic effects that play an 
important role in pulsar timing experiments. In an eccentric orbit, one has a changing 
time dilation of the “pulsar clock” due to a variation in the orbital velocity of the 
pulsar and a change of the gravitational redshift caused by the gravitational field of 
the companion. This so-called Einstein delay is a periodic effect, whose amplitude 
is given by the post-Keplerian parameter 7, and to first oder can be written as 



Ae = 7 sin U. (9) 

For sufficiently edge-on and/or eccentric orbits the propagation delay suffered by the 
pulsar signals in the gravitational field of the companion becomes important. This 
so-called Shapiro delay, to first order, reads 

As = — 2rln |^1 — ^cos U — s sino;(cos U — e) — s coso;(l — sin I/j , 

( 10 ) 
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where the two post-Keplerian parameters r and 5' are called range and shape of the 
Shapiro delay. The latter is linked to the inclination of the orbit with respect to the 
line of sight, i,by s = sin i. It is important to note, that for i 90° Eq. (10) breaks 
down and higher order corrections are needed. But so far, Eq. (10) is fully sufficient 
for the timing observations of known pulsars [55]. 

Concerning the post-Keplerian parameters related to quasi- stationary effects, for 
the wide class of boost-invariant gravity theories one finds that they can be expressed 
as functions of the Keplerian parameters, the masses, and parameters generically 
accounting for gravitational self-field effects (cf. Eq. (2)) [5, 39]: 



Co 

7 

r 

s 



rib 



\-e^ 
e 



' ^ ^ 1 
^“2 + 2 



n 
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rib 

1 -|- 5oc Gocrric 
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M O ’ 



( 11 ) 

( 12 ) 

(13) 

(14) 



plus from Eq. (5). Here we have listed only those parameters that play a role in 
this article. Eor a complete list and a more detailed discussion, the reader is referred 
to [39]. The quantities Gqc and eoc are related to the interaction of the companion 
with a test particle or a photon. The parameter JC^ accounts for a possible change in 
the moment of inertia of the pulsar due to a change in the local gravitational constant. 



= G, £ 


— ^Oc — 




= 1 and KC^p 


= 0. Consequently 




3nb 
1 — 


vl 




(15) 


II 

o 


— [l + 


Wc1 
M \ 


\n^Yl 

n O’ 


(16) 


L 




M 


^GR ^ 


Grtic 






(17) 




M 

X Ub 


c 




(18) 




Me 


Vb 







These parameters are independent of the internal structure of the neutron star(s), 
due to the effacement of the internal structure, a property of GR [34, 37]. Eor most 
alternative gravity theories this is not the case. Eor instance, in the mono-scalar-tensor 
theories Ti(ao, (3 q) of [56, 57], one finds^ 



^The mono-scalar- tensor theories T\ (ao, /?o) of [56, 57] have a conformal coupling function A ((/?) = 
— ifo) — v?o)^/2. The Jordan-Fierz-Brans-Dicke gravity is the sub-class with /?o = 0, 

and al = (2o;bd + 3)“k 
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( 20 ) 

( 21 ) 

( 22 ) 



where Vb = [G=j,(l + apac)Mrib]^^^ . The body-dependent quantities ap and ac 
denote the effective scalar coupling of pulsar and companion respectively, and 
(3a = daAldifQ where (^o denotes the asymptotic value of the scalar field at spa- 
tial infinity. The quantity kp is related to the moment of inertia Ip of the pulsar via 
kp = —din IpIdcpQ. For a given equation of state, the parameters a a, (3 a, and kA 
depend on the fundamental constants of the theory, e.g. ao and Pq in Ti(ao, Pq), 
and the mass of the body. As we will demonstrate later, these “gravitational form 
factors” can assume large values in the strong gravitational fields of neutron stars. 
Depending on the value of Po, this is even the case for a vanishingly small ao, where 
there are practically no measurable deviations from GR in the Solar system. In fact, 
even for ao = 0 ,sl neutron star, above a certain /3o-dependent critical mass, can have 
an effective scalar coupling a a of order unity. This non-perturbative strong-field 
behavior, the so-called “spontaneous scalarization” of a neutron star, was discovered 
20 years ago by Damour and Esposito-Farese [56]. 

Finally, there is the post-Keplerian parameter Pb, related to the damping of the 
orbit due to the emission of gravitational waves. We have seen above that in alternative 
gravity theories the back reaction from the gravitational wave emission might enter 
the equations of motion already at the 1.5 post-Newtonian level, giving rise to a 
Pb oc V^lc^. To leading order one finds in mono-scalar-tensor gravity the dipolar 
contribution from the scalar field [57-59]: 



h 



= — 27T 



mpiric 



1 + e^/2 Vl (ap - acf 
(1 — ^^)^/^ 1 -h apac 



+ 0(Vi/c^). 



(23) 



As one can see, the change in the orbital period due to dipolar radiation depends 
strongly on the difference in the effective scalar coupling a a . Binary pulsar systems 
with a high degree of asymmetry in the compactness of their components are therefore 
ideal to test for dipolar radiation. An order unity difference in the effective scalar 
coupling would lead to a change in the binary orbit, which is several orders of 
magnitude (^c^lV^) stronger than the quadrupolar damping predicted by GR. For 
GR one finds from the well-known quadrup ole formula [60]: 



1927T mptric 1 + /2A + 3?//96 

m 2 (1 - g2)7/2 



( 24 ) 
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Apart from a change in the orbital period, gravitational wave damping will also 
affect other post-Keplerian parameters. While gravitational waves carry away orbital 
energy and angular momentum, Keplerian parameters like the eccentricity and the 
semi-major axis of the pulsar orbit change as well. The corresponding post-Keplerian 
parameters are e and x respectively. However, these changes affect the arrival times 
of the pulsar signals much less than the and therefore do (so far) not play a role 
in the radiative tests with binary pulsars. 

As already mentioned in Sect. 1.2, there is no generic connection between the 
higher-order gravitational wave damping effects and the parameters Q,e, and ^ of the 
modified Einstein-Infeld-Hoffmann formalism. Such higher order, mixed radiative 
and strong-field effects depend in a complicated way on the structure of the gravity 
theory [39]. 

The post-Keplerian parameters are at the foundation of many of the gravity tests 
conducted with binary pulsars. As shown above, the exact functional dependence 
differs for given theories of gravity. A priori, the masses of the pulsar and the com- 
panion are undetermined, but they represent the only unknowns in this set of equa- 
tions. Hence, once two post-Keplerian parameters are measured, the corresponding 
equations can be solved for the two masses, and the values for other post-Keplerian 
parameters can be predicted for an assumed theory of gravity. Any further post- 
Keplerian measurement must therefore be consistent with that prediction, otherwise 
the assumed theory has to be rejected. In other words, if A > 3 post-Keplerian para- 
meters can be measured, a total of A — 2 independent tests can be performed. The 
method is very powerful, as any additionally measured post-Keplerian parameter is 
potentially able to fail the prediction and hence to falsify the tested theory of gravity. 
The standard graphical representation of such tests, as will become clear below, is 
the mass-mass diagram. Every measured post-Keplerian parameter defines a curve of 
certain width (given by the measurement uncertainty of the post-Keplerian parame- 
ter) in a mp-rric diagram. A theory has passed a binary pulsar test, if there is a region 
in the mass-mass diagram that agrees with all post-Keplerian parameter curves. 



2 Gravitational Wave Damping 
2.1 The Huls e-Taylor Pulsar 

The first binary pulsar to ever be observed happened to be a rare double neutron star 
system. It was discovered by Russell Hulse and Joseph Taylor in summer 1974 
[18]. The pulsar, PSR B1913-hl6, has a rotational period of 59ms and is in a 
highly eccentric {e = 0.62) 7.75-h orbit around an unseen companion. Shortly 
after the discovery of PSR B1913-hl6, it has been realized that this system may 
allow the observation of gravitational wave damping within a time span of a few 
years [61, 62]. 
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Table 2 Observed orbital timing parameters of PSR B1913+16, based on the Damour-Deruelle 
timing model (taken from [29]) 



To 


Time of periastron passage (MID) 


52144.90097841(4) 


X 


Projected semi-major axis of the pulsar orbit (s) 


2.341782(3) 


e 


Orbital eccentricity 


0.6171334(5) 


Pb 


Orbital period at Tq (d) 


0.322997448911(4) 


LOO 


Longitude of periastron at Tq (deg) 


292.54472(6) 


UJ 


Secular advance of periastron (deg/yr) 


4.226598(5) 


7 


Amplitude of Einstein delay (ms) 


4.2992(8) 


Pb 


Secular change of orbital period 


-2.423(1) X 10“'2 



Figures in parentheses represent estimated uncertainties in the last quoted digit 



The first relativistic effect seen in the timing observations of the Hulse-Taylor 
pulsar was the secular advance of periastron uj. Thanks to its large value of 4.2 deg/yr, 
this effect was well measured already one year after the discovery [63]. Due to the, 
a priori, unknown masses of the system, this measurement could not be converted 
into a quantitative gravity test. However, assuming GR is correct, Eq. (15) gives 
the total mass M of the system. From the modem value given in Table 2 one finds 
M = mp+mc = 2.828378 ± 0.000007 M© [29].^ 

It took a few more years to measure the Einstein delay (9) with good precision. 
In a single orbit this effect is exactly degenerate with the Roemer delay, and only 
due to the relativistic precession of the orbit these two delays become separable [61, 
65]. By the end of 1978, the timing of PSR B1913+16 yielded a measurement of 
the post-Keplerian parameter 7, which is the amplitude of the Einstein delay [66]. 
Together with the total mass from Eq. (16) can now be used to calculate the 
individual masses. With the modem value for 7 from Table 2, and the total mass 
given above, one finds the individual masses ntp = 1.4398 zb 0.0002 Mq and rric = 
1.3886 zb 0.0002 Mq for pulsar and companion respectively [29]. 

With the knowledge of the two masses, nip and the binary system is fully 
determined, and further GR effects can be calculated and compared with the observed 
values, providing an intrinsic consistency check of the theory. In fact, Taylor et al. 
[66] reported the measurement of a decrease in the orbital period Pb, consistent with 
the quadrupole formula (24). This was the first proof for the existence of gravitational 
waves as predicted by GR. In the meantime the Pt is measured with a precision of 
0.04 % (see Table 2). However, this is not the precision with which the validity of the 
quadrupole formula is verified in the PSR B 1913+16 system. The observed Pb needs 
to be corrected for extrinsic effects, most notably the differential Galactic acceleration 
and the Shklovskii effect, to obtain the intrinsic value caused by gravitational wave 



^Strictly speaking, this is the total mass of the system scaled with an unknown Doppler factor D, 
i.e. = /)-iMintnnsic pqj p^j. typical velocities, D — 1 is expected to be of order 10“'^, 

see for instance [64]. In gravity tests based on post-Keplerian parameters, the factor D drops out 
and is therefore irrelevant [54] . 
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damping [67, 68]. The extrinsic contribution due to the Galactic gravitational field 
(acceleration g) and the proper motion (transverse angular velocity in the sky /i) are 
given by 



= 



Pb 



Ko • (gpsR - go) + pd 



]■ 



(25) 



where Kq is the unit vector pointing towards the pulsar, which is at a distance d 
from the Solar system. For PSR B1913+16, and Kq are measured with very 
high precision, and also fi is known with good precision ("^8 %). However, there is 
a large uncertainty in the distance J, which is also needed to calculate the Galactic 
acceleration of the PSR B1913+16 system, gpsR, in Eq. (25). Due to its large distance, 
there is no direct parallax measurement for J, and estimates of d are based on model- 
dependent methods, like the measured column density of free electrons between 
PSR B1913-P16 and the Earth. Such methods are known to have large systematic 
uncertainties, and for this reason the distance to PSR B 1913-1-16 is not well known: 
J = 9.9 ib 3.1kpc [29, 69]. In addition, there are further uncertainties, e.g. in the 
Galactic gravitational potential and the distance of the Earth to the Galactic center. 
Accounting for all these uncertainties leads to an agreement between 
and P^^ at the level of about 0.3 % [29]. The corresponding mass-mass diagram is 
given in Eig. 4. As the precision of the radiative test with PSR B 19 13-1-16 is limited 
by the model-dependent uncertainties in Eq. (25), it is not expected that this test can 
be significantly improved in the near future. 



Fig. 4 Mass-mass diagram 
for PSR B1913-f 16 based on 
GR and the three observed 
post-Keplerian parameters Co 
{black), 7 {red) and Pb 
{blue). The dashed Pb curve 
is based on the observed Pb, 
without corrections for 
Galactic and Shklovskii 
effects. The solid Pb curve is 
based on the corrected 
(intrinsic) Pb, where the thin 
lines indicate the one- sigma 
boundaries. Values are taken 
from Table 2 
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Fig. 5 Shift in the time of 
periastron passage of 
PSRB1913+16due to 
gravitational wave damping. 
The parabola represents the 
GR prediction and the data 
points the timing 
measurements, with 
{vertical) error bars mostly 
too small to be resolved. The 
observed shift in periastron 
time is a direct measurement 
of the change in the 
world-line of the pulsar due 
to the back-reaction of the 
emitted gravitational waves 
(cf. Figures). The 
corresponding spatial shift 
amounts to about 20,000 km. 
Figure is taken from [29] 




Finally, besides the mass-mass diagram, there is a different way to illustrate the 
test of gravitational wave damping in PSR B1913-hl6. According to Eq. (7), the 
change in the orbital period, i.e. the post-Keplerian parameter is measured from 
a shift in the time of periastron passage, where U is a. multiple of 27 t. One finds for 
the shift in periastron time, as compared to an orbit with zero decay 

AT = ipbPbti^ + O(PbPin^), (26) 

where n = 0 , 1 , 2 , . . . denotes the number of the periastron passage, and is given 
by ^ ~ (r — To)/Ph. Equation (26) represents a parabola in time, which can be 
calculated with high precision using the masses that come from and 7 ^^ (see 
above). On the other hand, the observed cumulative shift in periastron can be extracted 
from the timing observations with high precision. A comparison of observed and 
predicted cumulative shift in the time of the periastron passage is given in Fig. 5. 



2.2 The Double Pulsar — The Best Test for Einstein^s 
Quadrupole Formula^ and More 

In 2003 a binary system was discovered where, at first, one member was identified as 
a pulsar with a 23 ms period [70], before about half a year later, the companion was 
also recognized as a radio pulsar with a period of 2.8 s [71]. Both pulsars, known as 
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PSRs J0737— 3039A and J0737— 3039B, respectively, (or A and B hereafter), orbit 
each other in less than 2.5 h in a mildly eccentric (e = 0.088) orbit. As a result, the 
system is not only the first and only double neutron star system where both neutron 
stars are visible as active radio pulsars, but it is also the most relativistic binary pulsar 
laboratory for gravity known to date [72]. Just to give an example for the strength 
of relativistic effects, the advance of periastron, uj, is 17 degrees per year, meaning 
that the eccentric orbit does a full rotation in just 21 years. In this subsection, we 
briefly discuss the properties of this unique system, commonly referred to as the 
Double Pulsar, and highlight some of the gravity tests that are based on the radio 
observations of this system. For detailed reviews of the Double Pulsar see [72, 73]. 

In the Double Pulsar system a total of six post-Keplerian parameters have been 
measured by now. Five arise from four different relativistic effects visible in pulsar 
timing [74], while a sixth one can be determined from the effects of geodetic preces- 
sion, which will be discussed in detail below. The relativistic precession of the orbit, 
(Jj, was measured within a few days after timing of the system commenced, and by 
2006 it was already known with a precision of 0.004 % (see Table 3). At the same time 
the measurement of the amplitude of Einstein delay, 7, reached 0.7 % (see Table 3). 
Due to the periastron precession of 17 degrees per year, the Einstein delay was soon 
well separable from the Roemer delay. Two further post-Keplerian parameters came 
from the detection of the Shapiro delay: the shape and range parameters 5 ’ and r. 
They were measured with a precision of 0.04 and 5 %, respectively (see Table 3). 



From the measured value 5 ' = sin i = 0.99974 



+ 0.00016 

- 0.00039 



(/ 



88.7°^q 30 ) one can 



already see how exceptionally edge-on this system is. Finally, the decrease of the 
orbital period due to gravitational wave damping was measured with a precision of 
1.4 % just three years after the discovery of the system (see Table 3). 

A unique feature of the Double Pulsar is its nature as a “dual-line source”, i.e. we 
measure the orbits of both neutron stars at the same time. Obviously, the sizes of 
the two orbits are not independent from each other as they orbit a common center 



Table 3 A selection of observed orbital timing parameters of the Double Pulsar, based on the 
Damour-Deruelle timing model (taken from [74]) 



xa = Cl A sin i jc 


Projected semi-major axis of A (s) 


1.415032(1) 


xb = ub sin//c 


projected semi-major axis of (s) 


1.5161(16) 


e 


Orbital eccentricity 


0.0877775(9) 


Pb 


Orbital period (d) 


0.10225156248(5) 


UJ 


Secular advance of periastron (deg/yr) 


16.89947(68) 


7 


amplitude of Einstein delay for A (ms) 


0.3856(26) 


Pb 


Secular change of orbital period 


-1.252(17) X 10-'2 


s 


Shape of Shapiro delay for A 


0.99974(-39, +16) 


r 


Range of Shapiro delay for A (/is) 


6.21(33) 



All post-Keplerian parameters below are obtained from the timing of pulsar A. The timing precision 
for pulsar B is considerably lower, and allows only for a, in comparison, low precision measurement 
(~0.3 %) of UJ [74]. Figures in parentheses represent estimated uncertainties in the last quoted digit 
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of mass. In GR, up to first post-Newtonian order the relative size of the orbits is 
identical to the inverse ratio of masses. Hence, by measuring the orbits of the two 
pulsars (relative to the centre of mass), we obtain a precise measurement of the mass 
ratio. This ratio is directly observable, as the orbital inclination angle is obviously 
identical for both pulsars, i.e. 



This expression is not just limited to GR. In fact, it is valid up to first post-Newtonian 
order and free of any explicit strong-field effects in any Lorentz-invariant theory of 
gravity (see [40] for a detailed discussion). Using the parameter values of Table 3, one 
finds that in the Double Pulsar the masses are nearly equal with R = 1.0714ib0.0011. 

As it turns out, all the post-Keplerian parameters measured from timing are con- 
sistent with GR. In addition, the region of allowed masses agrees well with the 
measured mass ratio R (see Fig. 6). One has to keep in mind, that the test presented 
here is based on data published in 2006 [74]. In the meantime continued timing lead 
to a significant decrease in the uncertainties of the post-Keplerian parameters of the 
Double pulsar. This is especially the case for for which the uncertainty typically 
decreases with [75], Tobs being the total time span of timing observations. The 
new results will be published in an upcoming publication (Kramer et al., in prep.). 




niA ciB ciB sin i /c _ xb 



(27) 
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Fig. 6 GR mass-mass diagram based on timing observations of the Double Pulsar. The orange 
areas are excluded simply by the fact that sin/ < 1. The figure is taken from [72] (^^so lines 
removed) and based on the timing solution published in [74] 
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As reported in [33], presently the Double Pulsar provides the best test for the GR 
quadrupole formalism for gravitational wave generation, with an uncertainty well 
below the 0.1 % level. As discussed above, the Hulse-Taylor pulsar is presently lim- 
ited by uncertainties in its distance. This raises the valid question, at which level such 
uncertainties will start to limit the radiative test with the Double Pulsar as well. Com- 
pared to the Hulse-Taylor pulsar, the Double Pulsar is much closer to Earth. Because 
of this, a direct distance estimate of 1.15_Q'^^kpc based on a parallax measurement 
with long-baseline interferometry was obtained [76]. Thus, with the current accuracy 
in the measurement of distance and transverse velocity, GR tests based on can be 
taken to the 0.01 % level. 

With the large number of post-Keplerian parameters and the known mass ratio, the 
Double Pulsar is the most over-constrained binary pulsar system. For this reason, one 
can do more than just testing specific gravity theories. The Double Pulsar allows for 
certain generic tests on the orbital dynamics, time dilation, and photon propagation 
of a spacetime with two strongly self-gravitating bodies [72]. First, the fact that the 
Double Pulsar gives access to the mass ratio, R, in any Forentz-invariant theory of 
gravity, allows us to determine mp^lM = P/(l -h P) = 0.51724 zb 0.00026 and 
m^jM = 1/(1 -h P) = 0.48276 zb 0.00026. With this information at hand, the 
measurement of the shape of the Shapiro delay 5 ' can be used to determine via 
Eq. (14): V^/c = (2.0854 zb 0.0014) x 10“^. At this point, the measurement of 
the post-Keplerian parameters cj, 7 , and r (Eqs. (11)-(13)) can be used to impose 
restrictions on the “strong-field” parameters of Fagrangian (2) [72]: 





= 0.9995 zb 0.0016, 


(28) 


5 


<^0B 


= 1.005 zb 0.010, 


(29) 


eoB + 1 Gob 


= 1.009 zb 0.054. 


(30) 


4 g 



This is in full agreement with GR, which predicts one for all three of these expres- 
sions. Consequently, nature cannot deviate much from GR in the quasi- stationary 
strong-field regime of gravity (G2 in Fig. 1). 



2.3 PSR J1738+0333 — The Best Test for Scalar-Tensor 
Gravity 

The best “pulsar clocks” are found amongst the fully recycled millisecond pulsars, 
which have rotational periods less than about 10ms (see e.g. [77]). A result of the 
stable mass transfer between companion and pulsar in the past — responsible for the 
recycling of the pulsar — is a very efficient circularization of the binary orbit, that 
leads to a pulsar- white dwarf system with very small residual eccentricity [78]. For 
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such systems, the post-Keplerian parameters Co and 7 are generally not observable. 
There are a few cases where the orbit is seen sufficiently edge-on, so that a measure- 
ment of the Shapiro delay gives access to the two post-Keplerian parameters r and 5 ' 
with good precision (see e.g. [79], which was the first detection of a Shapiro delay in 
a binary pulsar). With these two parameters the system is then fully determined, and 
in principle can be used for a gravity test in combination with a third measured (or 
constrained) post-Keplerian parameter (e.g. Pb). Besides the Shapiro delay parame- 
ters, some of the circular binary pulsar systems offer a completely different access 
to their masses, which is not solely based on the timing observations in the radio 
frequencies. If the companion star is bright enough for optical spectroscopy, then 
we have a dual-line system, where the Doppler shifts in the spectral lines can be 
used, together with the timing observations of the pulsar, to determine the mass ratio 
R. Furthermore, if the companion is a white dwarf, the spectroscopic information 
in combination with models of the white dwarf and its atmosphere can be used to 
determine the mass of the white dwarf ultimately giving the mass of the pulsar 
via Mp = R Me. As we will see in this and the following subsection, two of the best 
binary pulsar systems for gravity tests have their masses determined through such a 
combination of radio and optical astronomy. 

PSR J1738-h0333 was discovered in 2001 [80]. It has a spin period P of 5.85 ms 
and is a member of a low-eccentricity (e < 4 x 10~^) binary system with an orbital 
period P^ of just 8.5 h. The companion is an optically bright low-mass white dwarf. 
Extensive timing observation over a period of 10 years allowed a determination of 
astrometric, spin and orbital parameters with high precision [28], most notably 

• A change in the orbital period of (—17.0 zb 3.1) x 10~^^. 

• A timing parallax, which gives a model independent distance estimate of d = 

1.47ib0.10kpc. 

The latter is important to correct for the Shklovskii effect and the differential Galactic 
acceleration to obtain the intrinsic Pt (cf. Eq. (25)). Additional spectroscopic obser- 
vations of the white dwarf gave the mass ratio R = 8.1 zb 0.2 and the companion 
mass rUc = 0.181^q qq^ Mq, and consequently the pulsar mass nip = 1.47^q Mq 
[ 81]. It is important to note, that the mass determination for PSR B1738-b0333 is 
free of any explicit strong-field contributions, since this is the case for the mass ratio 
[40], and certainly for the mass of the white dwarf, which is a weakly self-gravitating 
body, i.e. a gravity regime that has been well tested in the Solar system (G1 in Pig. 1). 

After using Eq. (25) to correct for the Shklovskii contribution, 6Pb = Ph^dje — 
( 8 . 3 ^ 03 ) X 10“^^, and the contribution from the Galactic differential acceleration, 
^P^ = (0.58^q I 4 ) X 10 “^^, one finds an intrinsic orbital period change due to 
gravitational wave damping of P^^^ — (—25.9 zb 3.2) x 10“^^. This value agrees 
well with the prediction of GR, as can be seen in Pig. 7. 

The radiative test with PSR J1738-b0333 represents a ~15 % verification of GR’s 
quadrupole formula. A comparison with the <0. 1 % test from the Double Pulsar (see 
Sect. 2.2) raises the valid question of whether the PSR J1738-b0333 experiment is 
teaching us something new about the nature of gravity and the validity of GR. To 
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Fig. 7 GR mass-mass 
diagram based on the timing 
observations of 
PSRJ1738-f0333 and the 
optical observations of its 
white-dwarf companion 



respectively. The thin lines g 

indicate the one-sigma errors Jg 

of the measured parameters. ^ 

The grey area is excluded by 
the condition sin / < 1 ^ 




address this question, let’s have a look at Eq. (23). Dipolar radiation can be a strong 
source of gravitational wave damping, if there is a sufficient difference between the 
effective coupling parameters ap and of pulsar and companion respectively. For 
the Double Pulsar, where we have two neutron stars with rup ~ one generally 
expects that ~ and therefore the effect of dipolar radiation would be strongly 

suppressed. On the other hand, in the PSR J1738+0333 system there is a large 
difference in the compactness of the two bodies. For the weakly self-gravitating 
white-dwarf companion — cto. i-O. it assumes the weak-field value,^ while the 
strongly self-gravitating pulsar can have an ap that significantly deviates from ao. In 
fact, as discussed in Sect. 1 .4, ap can even be of oder unity in the presence of effects 
like strong-field scalarization. In the absence of non-perturbative strong-field effects 
one can do a first order estimation (ap — ac) oc (ep — e^) + 0(e^). For the Double 
Pulsar one finds (cp — ~ 6 x 10“^, which is significantly smaller than for the 

PSR J1738-h0333 system, which has (ep — ~ 0.012.^ As a consequence, the 

orbital decay of asymmetric systems like PSR J1738-h0333 could still be dominated 
by dipolar radiation, even if the Double Pulsar agrees with GR. For this reason, 
PSR J 173 8-1-033 3 is particularly useful to test gravity theories that violate the strong 
equivalence principle and therefore predict the emission of dipolar radiation. A well 
known class of gravity theories, where this is the case, are scalar- tensor theories. As 
it turns out, PSR J1738-h0333 is currently the best test system for these alternatives 
to GR (see Fig. 8). In terms of Eq. (23), one finds 

\ap — ad < 2 X 10“^ (95 % confidence), (31) 



^From the Cassini experiment [11] one obtains |aol < 3 x 10 ^ (95 % confidence). 

^These numbers are based on the equation of state MPAl in [82] . Within GR, MPAl has a maximum 
neutron-star mass of 2.46 M©, which can also account for the high-mass candidates of [83-85]. 
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Fig. 8 Constraints on the class of T\ (ao , /?o) scalar-tensor theories of [56, 57] , from different binary 
pulsar and Solar system (Cassini and Lunar Laser Ranging) experiments. The grey area indicates 
the still allowed T\ theories, and includes GR (ao = Po = 0). It is obvious that PSR J1738-I-0333 is 
the most constraining experiment for most of the Po range, and is even competitive with Cassini in 
testing the Jordan-Fierz-Brans-Dicke theory (Pq = 0). As can be clearly seen, the double neutron- 
star systems PSR B 1534+ 12 [86], PSR B 1913+ 16 (Hulse-Taylor pulsar) and PSR J0737-3039A/B 
(Double Pulsar) are considerably less constraining, as explained in the text. Figure is taken from [28] 

where for the weakly self-gravitating white dwarf companion ac — cto- This limit 
can be interpreted as a generic limit on dipolar radiation, where ap — ac is the 
difference of some hypothetical (scalar- or vector-like) “gravitational charges” [38]. 



2.4 PSR J0348+0432 — A Massive Pulsar 
in a Relativistic Orbit 

PSR J0348+0432 was discovered in 2007 in a drift scan survey using the Green 
Bank radio telescope (GBT) [87, 88]. PSR J0348+0432 is a mildly recycled radio- 
pulsar with a spin period of 39 ms. Soon it was found to be in a 2.46-h orbit with a 
low-mass white-dwarf companion. In fact, the orbital period is only 15 s longer than 
that of the Double Pulsar, which by itself makes this already an interesting system 
for gravity. Initial timing observations of the binary yielded an accurate astrometric 
position, which allowed for an optical identification of its companion [89]. As it 
turned out, the companion is a relatively bright white dwarf with a spectrum that 
shows deep B aimer lines. Like in the case of PSR J 173 8+03 33, one could use 
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high-resolution optical spectroscopy to determine the mass ratio R = 11.70 zb 0.13 
and the companion mass rric = 0. 172zb0.003 Mq . For the mass of the pulsar one then 
finds = R Me = 2.01zb0.04MQ, which is presently the highest, well determined 
neutron star mass, and only the second neutron star with a well determined mass close 
to 2 MqJ 

Since the discovery of PSR J0348-h0432 there have been regular timing obser- 
vations with three of the major radio telescopes in the world, the 100-m Green 
Bank Telescope, the 305-m radio telescope at the Arecibo Observatory, and the 
100-m Effelsberg radio telescope. Based on the timing data, in 2013 Antoniadis 
et al. [89] reported the detection of a decrease in the orbital period of Ph = 
(—2.73 zb 0.45) zb 10“^^ that is in full agreement with GR. In numbers: 

= 1.05 ±0.18. (32) 

As it turns out, using the distance inferred from the photometry of the white dwarf 
(d ~ 2.1kpc) corrections due to the Shklovskii effect and differential acceleration 
in the Galactic potential (see Eq. (25)) are negligible compared to the measurement 
uncertainty in Pt. 

Like PSR 1738+0333, PSR J0348+0432 is a system with a large asymmetry in the 
compactness of the components, and therefore well suited for a dipolar radiation test. 
Using Eq. (23), the limit (32) can be converted into a limit on additional gravitational 
scalar or vector charges: 

|ct /7 — ctol < 5 X 10“^ (95 % confidence). (33) 

This limit is certainly weaker than the limit (32), but it has a new quality as it tests a 
gravity regime in neutron stars that has not been tested before. Gravity tests before 
[89] were confined to “canonical” neutron star masses of ~ 1 .4 Mq . PSR J0348+0432 
for the first time allows a test of the relativistic motion of a massive neutron star, 
which in terms of gravitational self-energy lies clearly outside the tested region (see 
Fig. 9). 

Although an increase in fractional binding energy of about 50% does not seem 
much, in the highly non-linear gravity regime of neutron stars it could make a signif- 
icant difference. To demonstrate this, [89] used the scalar-tensor gravity Ti(ao, Pq) 
of [56, 57], which is known to behave strongly non-linear in the gravitational fields 
of neutron stars, in particular for Po < —4.0. As shown in Fig. 10, PSR J0348+0432 
excludes a family of scalar-tensor theories that predict significant deviations from 
GR in massive neutron stars and were not excluded by previous experiments, most 
notably the test done with PSR J1738+0333. 

With PSR J0348+0432, gravity tests now cover a range of neutron star masses 
from 1.25 Mq (PSR J0737— 3039B) to 2Mq. No significant deviation from GR in 
the orbital motion of these neutron stars was found. These findings have interesting 



^The first well determined two Solar mass neutron star is PSR 11614—2230 [90], which is in a wide 
orbit and therefore does not provide any gravity test. 



674 



N. Wex 




riiNs (Solar mass) 



Fig. 9 Fractional gravitational binding energy of a neutron star as a function of its (inertial) 
mass, based on equation of state MPAl [82]. The plot clearly shows the prominent position of 
PSR J0348+0432. The other dots indicate the neutron star masses of the individual test systems in 
Fig. 8 

implications for the upcoming ground-based gravitational wave experiments, as a 
significant amount of dipolar radiation would drive the phase evolution of the merging 
binary many cycles away from the GR template and, consequently, degrade the ability 
to accurately determine the parameters of the merging system or even prevent the 
detection of the signal. A detailed discussion can be found in [89]. 



3 Geodetic Precession 



A few months after the discovery of the Hulse-Taylor pulsar, Damour and Ruffini 
[51] proposed a test for geodetic precession in that system. If the pulsar spin is 
sufficiently tilted with respect to the orbital angular momentum, the spin direction 
should gradually change over time (see Sect. 1.3). A change in the orientation of the 
spin-axis of the pulsar with respect to the line-of-sight should lead to changes in the 
observed pulse profile. These pulse-profile changes manifest themselves in various 
forms [92], such as changes in the amplitude ratio or separation of pulse components 
[93, 94], the shape of the characteristic swing of the linear polarization [95], or the 
absolute value of the position angle of the polarization in the sky [72]. In principle, 
such changes could allow for a measurement of the precession rate and by this yield 
a test of GR. In practice, it turned out to be rather difficult to convert changes in the 
pulse profile into a quantitative test for the precession rate. Indeed, the Hulse-Taylor 
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Fig. 10 Effective scalar coupling as a function of the neutron-star mass, in the T\ (o^o, Po) mono- 
scalar-tensor gravity theory of [56, 57]. For the linear coupling of matter to the scalar field we have 
chosen ao = 10“^, a value well below the sensitivity of any near-future Solar system experiment, 
like GAIA [91]. The blue curves correspond to stable neutron-star configurations for different values 
of the quadratic coupling —5 to —4 {top to bottom) in steps of 0.1. The yellow area indicates 
the parameter space still allowed by the limit (31) [label ‘J1738’], whereas only the green area is 
in agreement with the limit (33) [label ‘J0348’]. The plot shows clearly how the massive pulsar 
PSR J0348+0432 probes deep into a new gravity regime. Neutron-star calculations are based on 
equation of state MPAl [82] (see [89] for a different equation-of- state) 



pulsar, in spite of prominent profile changes due to geodetic precession [93, 94], 
does not (yet) allow for a quantitative test of geodetic precession. This is mostly 
due to uncertainties in the orientation of the magnetic axis and the intrinsic beam 
shape [96]. 

Profile and polarization changes due to geodetic precession have been observed in 
other binary pulsars as well [97, 98], but again did not lead to a quantitative gravity 
test. A complete list of binary pulsars that up to date show signs of geodetic precession 
can be found in [33]. Out of the six pulsars listed in [33], so far only two allowed 
for quantitative constraints on their rate of geodetic precession: PSR B 1534+ 12 [95] 
and pulsar B of the Double Pulsar [99] . In the following we will discuss the latter 
in more details, as it currently provides the best test for the geodetic precession of a 
binary pulsar. 

In Sect. 2.2, we have seen the Double Pulsar as one of the most exciting “labora- 
tories” for relativistic gravity, with a wealth of relativistic effects measured, allowing 
the determination of 5 post-Keplerian parameters from timing observations: cj, 7, 
Ph, r, 5'. Calculating the inclination angle of the orbit i from 5 ' = sin/, one finds 
that the line-of-sight is inclined with respect to the plane of the binary orbit by just 
about 1.3° [74]. As a consequence, during the superior conjunction the signals of 
pulsar A pass pulsar 5 at a distance of only 20,000 km. This is small compared to 
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the extension of pulsar B's magnetosphere, which is roughly given by the radius 
of the light-cylinder^ r\c = cP !2 ti ~ 130,000 km. And indeed, at every superior 
conjunction pulsar A gets eclipsed for about 30 s due to absorption by the plasma in 
the magnetosphere of pulsar B [71]. A detailed analysis revealed that during every 
eclipse the light curve of pulsar A shows flux modulations that are spaced by half or 
integer numbers of pulsar 5’s rotational period [100]. This pattern can be understood 
by absorbing plasma that co-rotates with pulsar B and is confined within the closed 
field lines of the magnetic dipole of pulsar B. As such, the orientation of pulsar 5’s 
spin is encoded in the observed light curve of pulsar A [99]. Over the course of 
several years, Breton et al. [99] observed characteristic shifts in the eclipse pattern, 
that can be directly related to a precession of the spin of pulsar B. From this analysis, 
Breton et al. were able to derive a precession rate of 

4 77+0.66 (34) 

The measured rate of precession is consistent with that predicted by GR = 

5.07deg/yr) within its one-sigma uncertainty. This is the sixth(!) post-Keplerian 
parameter measured in the Double-Pulsar system. Furthermore, for the coupling 
function F^, which parametrizes strong-field deviation in alternative gravity theories 
(see Eq. (5)), one finds 



T^/g = 1.90 zb 0.22, (35) 

which agrees with the GR value F^/ G = 2. Although the geodetic precession of 
a gyroscope was confirmed to better than 0.3 % by the Gravity Probe B experiment 
[13], the clearly less precise test with Double Pulsar 5 (13 %) for the first time gives 
a good measurement of this effect for a strongly self-gravitating “gyroscope”, and 
by this represents a qualitatively different test. 

The geodetic precession of pulsar B not only changes the pattern of the flux 
modulations observed during the eclipse of pulsar A, it also changes the orientation 
of pulsar 5’s emission beam with respect to our line-of- sight. As a result of this, 
geodetic precession has by now turned pulsar B in such a way, that since 2009 
it is no longer seen by radio telescopes on Earth [101]. From their model, Perera 
et al. [101] predicted that the reappearance of pulsar A is expected to happen around 
2035 with the same part of the beam, but could be as early as 2014 if one assumes a 
symmetric beam shape. 

Finally, for pulsar A GR predicts a precession rate of 4.78deg/yr, which is com- 
parable to that of pulsar B. However, since the light-cylinder radius of pulsar A 
(~ 1000 km) is considerably smaller than that of pulsar B, there are no eclipses that 
could give insight into the orientation of its spin. Moreover, long-term pulse pro- 
file observations indicate that the misalignment between the spin of pulsar A and the 
orbital angular momentum is less than 3.2° (95 % confidence) [102]. For such a close 
alignment, geodetic precession is not expected to cause any significant changes in 



The light-cylinder is defined as the surface where the co-rotating frame reaches the speed of light. 
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the spin direction (cf. Eqs. (4) and (5)). This, on the other hand, is good news for tests 
based on timing observations. One does not expect a complication in the analysis of 
the pulse arrival times due to additional modeling of a changing pulse profile. 



4 Local Lorentz Invariance of Gravity 

Some alternative gravity theories allow the Universal matter distribution to single 
out the existence of a preferred frame, which breaks the symmetry of local Lorentz 
invariance (LLI) for the gravitational interaction. In the post-Newtonian parame- 
trization of semi-conservative gravity theories, LLI violation is characterized by two 
parameters, a\ and ct 2 [5]. Non-vanishing a\ and ct2 modify the dynamics of self- 
gravitating systems that move with respect to the preferred frame (preferred-frame 
effects). In GR one finds a\ = a 2 = 0. 

As the most natural preferred frame, generally one chooses the frame associated 
with the isotropic cosmic microwave background (CMB), meaning that the preferred 
frame is assumed to be fixed by the global matter distribution of the Universe. Lrom 
the five-year Wilkinson Microwave Anisotropy Probe (WMAP) satellite experiment, 
a CMB dipole measurement with high precision was obtained [ 103] . The CMB dipole 
corresponds to a motion of the Solar system with respect to the CMB with a velocity 
of 369.0ib0.9km/s in direction of Galactic longitude and latitude (/, b) = (263.99°± 
0.14°, 48.26° zb 0.03°). The numbers quoted in the next two sub-sections, will be 
with respect to the CMB frame. A generalization to other frames is straightforward, 
and was done in some of the references cited below. 

The most important (weak-field) constraints on preferred-frame effects do come 
form Lunar Laser Ranging (LLR) [104], 

«i = (-0.7 ± 1.8) X 10“"^ (95 %CL), (36) 

and the alignment of the Sun’s spin with the total angular momentum of the planets 
in the Solar system [105], 



|o;2l < 2.4 X 10“’. 



(37) 



4.1 Constraints on aifrom Binary Pulsars 

In binary pulsars, the isotropic violation of Lorentz invariance in the gravitational 
sector should lead to characteristic preferred frame effects in the binary dynamics, if 
the bary center of the binary is moving relative to the preferred frame with a velocity w. 
Lor small-eccentricity binaries, the effects induced by a\ and 6 l 2 (the hat indicates 
possible modifications by strong-field effects) decouple, and can therefore be tested 
independently [106, 107]. 
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In case of a non- vanishing a\, the observed eccentricity vector e of a small- 
eccentricity binary pulsar is a vectorial superposition of a ‘rotating eccentricity’ 
eR(t) and a fixed ‘forced eccentricity’ ef\ e(t) = ef + e/?(0 [106]. The rotating 
eccentricity has a constant length e r , and rotates with the relativistic precession of 
periastron, uj, in the orbital plane. This is identical to the dynamics caused by a 
violation of the strong equivalence principle [106, 108], with the forced eccentricity 
pointing into the direction of L x w. As a consequence, the binary orbit changes 
from a less to a more eccentric configuration and back on a time scale of 



Itt 

uo 



(1140yr) 



\lday) \2 Mq) 



-Ij'h 



(38) 



where we have assumed that the true Cj does not deviate significantly from the one 
predicted by GR (Eq. (15)), an assumption that is well justified by other binary-pulsar 
experiments, like the generic tests in the Double Pulsar (cf. Sect. 2.2). 

The forced eccentricity e/7 is determined by the strength of the preferred frame 
effect. Its magnitude is approximately given by 



ep — 0.093 



Mp - me 

M 



/ M / Pb / w sini/) \ 

\2Mq) V 1 day / V300km/sj 



(39) 



where ip is the angle between w and L (see [106] for a detailed expression). The obser- 
vation of small eccentricities in binary pulsars, like e ^ 10“^ for PSR J1738+0333 
does not directly constrain a\. The orientation of the a priory unknown intrinsic tR 
could be such, that it compensates for a large e/7. If the system is sufficiently old, 
one can assume a uniform probability distribution in [0°, 360°) for 0(t). Like in the 
Damour-Schafer test for SEP, one can now set a probabilistic upper limit onep, and 
by this on by excluding 0 values close to alignment oftR and e/7. Based on this 
method, [106] found a limit of \ai\ < 5 x 10““^ with 90% confidence. 

But even if 0 happens to be close to 0°, due to the relativistic precession it will not 
remain there, and a large e p cannot remain hidden for ever. In fact, if Co is sufficiently 
large (greater than ~1° per year) a significant change in the orbital eccentricity 
should become observable over time scales of a few years, even if at the start of 
the observation there was a complete cancellation between e/? and e/7. This can be 
used to constrain a\ [107]. The best such test comes from PSR J1738+0333 (see 
Sect. 2.3). This binary pulsar is ideal for this test for several reasons: 

• The orbit has an extremely small, well constrained eccentricity of ~10“^ [28]. 

• The (calculated) relativistic precession of periastron is about 1.6deg/yr, and the 
binary has been observed by now for about 10 years [28]. Hence, 0{t) has covered 
an angle of 16° in that time. 

• The 3D velocity with respect to the Solar system is known with good precision 
from timing and optical observations, meaning that one can compute w [28, 81]. 
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• The orientation of the system is such, that the unknown angle of the ascending 
node Q has little influence on the a\ limit, hence there is no need for probabilistic 
considerations to exclude certain values of ^2 [107]. 

Consequently, PSR J1738+0333 leads to the best constraints of cti-like violations 
of the local Lorentz invariance of gravity, giving [107] 



This limit is not only five times better than the current most stringent limit on a\ 
obtained in the Solar system (cf. Eq. (36)), it is also sensitive to potential deviations 
related to the strong self-gravity of the pulsar. For non-perturbative deviations one 
can, for illustration purposes, do an expansion with respect to the fractional binding 
energy e of the neutron star. 



Since e ~ —0.1 for PSR J1738+0333, we get tight constraints for Ci, a parameter 
that is virtually unconstraint by the LLR experiment, since e — 5 x 10 for the 
Earth. 



4.2 Constraints on dcifrom Binary and Solitary Pulsars 

In the presence of a non-vanishing 0 ^ 2 , a small-eccentricity binary system experiences 
a precession of the orbital angular momentum around the fixed direction w with an 
angular frequency 



where ip is the angle between the orbital angular momentum and w [107]. In binary 
pulsars, such a precession should become visible as a secular change in the projected 
semi-major axis of the pulsar orbit, i, which is an observable timing parameter. The 
two binary pulsars PSRs J1012+5307 and J1738-I-0333 turn out to be particularly 
useful for such a test, since both of them have optically bright white dwarf com- 
panions, which allowed the determination of the masses in the system, and the 3D 
systemic velocity with respect to the preferred frame [81, 109, 110]. 

Unfortunately, in general, the orientation of a binary pulsar orbit with respect to 
w and the line-of-sight cannot be fully determined from timing observations. As a 
consequence, one cannot directly test 0^2 from observed constraints for i . In fact, since 
the longitude of the ascending node ^2 is not measured, neither for PSR J1012-h5307 
nor for PSR J1738-h0333, the orientation of these systems could in principle be such, 
that an 52-induced precession would not lead to a significant i. Assuming a random 
distribution of ^2 in the interval [0°, 360°), one can use probabilistic considerations 



a\ = —0.4^3’^ X 10 ^ (95 % confidence). 



(40) 



6l\ — OL\ -\- C\e -h O(e^). 
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(42) 
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to exclude such unfavorable orientations. A detailed discussion of this test can be 
found in [107], where the following 95 % confidence limits are derived 

\ q .2\ < 3.6 X fromPSR J1012+5307, 

|a 2 l < 2.9 X lO""^ from PSR J1738+0333, (43) 

|o; 2 | < 1.8 X 10“^ from PSRs J1012+5307 and J1738+0333 combined. 

It is important to note, that for the last limit, based on the statistical combination of 
the two systems, one has to assume that 6 l 2 has only a weak functional dependence 
on the neutron-star mass in the range of 1 .3 - 2.0 Mq. 

The limit for ol 2 obtained from binary pulsars are still several orders of magnitude 
weaker than the a 2 limit which Nordtvedt derived in 1987 from the alignment of the 
Sun’s spin with the orbital planes of the planets [105]. In the same paper, Nordtvedt 
pointed out that solitary fast-rotating pulsars could be used in a similar way to obtain 
tight constraints for a 2 . This can be directly seen from Eq. (42), which holds for a 
rotating self-gravitating star if is replaced by the rotational period P of the star. 
While the five-billion-year base-line for the Solar experiment is typically a factor 
of ~10^ longer than the observational time-span Tobs of pulsars, for millisecond 
pulsars P is ~10^ shorter than the rotational period of the Sun. In fact, the first 
millisecond pulsar PSR B 1937+21, discovered in 1982 [111], by now has a figure 
of merit Tobs/ P that is "^10 times larger than that of the Sun. 

The precession of a solitary pulsar due to a non- vanishing 6 l 2 would lead to char- 
acteristic changes in the observed pulse profile over time-scales of years, just like 
in the case of binary pulsars that experience geodetic precession (cf. Sect. 3). Con- 
sequently, a non-detection of such changes can be converted into constraints for 0(2- 
Recently, Shao et al. [112] used the two solitary millisecond pulsars PSRs B 1937+21 
and J 1744— 1 134 for such an experiment. For both pulsars they utilized a consistent 
set of data, taken over a time span of approximately 15 years with the same observing 
system at the 100-m Effelsberg radio telescope. The continuity in the observing sys- 
tem was key for an optimal comparison of the high signal-to-noise ratio profiles over 
time. As it turns out, both pulsars, PSRs B1937+21 and J1744— 1134, do not show 
any detectable profile evolution in the last 15 years. As an example of such a non- 
detection see Fig. 11, which shows two pulse profiles of PSR B 1937+21 obtained at 
different epochs. 

Similarly to the 6 l 2 test with the binary pulsars, there are unknown angles in the 
orientation of the pulsar spin, for which certain values have to be excluded based 
on probabilistic considerations. From extensive Monte-Carlo simulations Shao et al. 
found with 95 % confidence 

\ct 2 \ < 2.5 X 10“* from PSR B1937+21, 

\ct2\ < 1.5 X 10"* from PSR J1744-1134, (44) 

\ct 2 \ < 1.6 X 10“^ from PSRs B1937+21 and J1744-1134 combined. 
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pulse phase 

Fig. 11 Comparison of two pulse profiles of PSR B 1937+21 obtained at two different epochs. The 
blue one was obtained on September 2, 1997, while the red one was obtained on June 6, 2009. The 
main peak is aligned and scaled to have the same intensity. There exists no visible difference within 
the noise level. Profiles were taken from [112] 

These limits are significantly tighter than the ct2 limit from the Sun’s spin orientation. 
Like in the case of the a\ test (previous subsection), this test also covers potential 
deviations related to the strong self-gravity of the pulsar, and in the combination of the 
two pulsars, makes the assumption that 6 l 2 depends only weakly on the neutron-star 
mass. 

An important difference to the aforementioned tests with binary pulsars is, that for 
solitary pulsars one cannot determine the radial velocity. It enters the determination of 
w as a free parameter. However, as shown in [1 12], the unknown radial velocities for 
PSRs B1937+21 and J1744— 1134 only have a marginal effect on the limits. For the 
limits above it was assumed that both pulsars are gravitationally bound in the Galactic 
potential. But even if one relaxes this assumption and allows for unphysically large 
radial velocities, exceeding 1000 km/s, the limits get weaker by at most "^40 %. 



5 Summary and Outlook 

With their discovery of the first binary pulsar four decades ago, Joseph Taylor and 
Russell Hulse opened a new field of experimental gravity, which has been an active 
field of research ever since. Besides the Hulse-Taylor pulsar, which led to the first con- 
firmation of the existence of gravitational waves, astronomy has seen the discovery of 
many new binary pulsars suitable for precision gravity tests. Arguably, the most exit- 
ing discovery was the Double Pulsar in 2003, which by now provides the best test for 
GR’s quadrupole formalism of gravitational wave generation (<0.1 % uncertainty). 
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and the best test for the relativistic spin precession of a strongly self-gravitating body. 
In addition to this, it is the binary pulsar with the most post-Keplerian parameters 
measured, allowing for a number of generic constraints on strong-field deviations 
from GR. For certain aspects of gravity, binary pulsars with white dwarf companions 
have proven to be even better “test laboratories” than the Double Pulsar. These are 
gravitational phenomena, predicted by alternatives to GR, that depend on the differ- 
ence in the compactness/binding energy of the two components, like gravitational 
dipolar radiation. By now, pulsar- white dwarf systems, like PSR J1738+0333, set 
quite stringent limits (coupling strength less than about 10“^) on the existence of any 
additional “gravitational charges” associated with light or massless fields. The recent 
discovery of a massive pulsar in a relativistic binary system (PSR J0348-h0432), for 
the first time allowed to test the orbital motion of a neutron star that is significantly 
more compact than pulsars of previous gravity tests. For certain aspects of gravity, 
solitary pulsars turned out to be ideal probes. The current best limit on the PPN 
parameters 0 ^ 2 , related to the existence of a preferred frame for gravity, does come 
from pulse-profile observations of two solitary millisecond pulsars. In all these tests, 
pulsars go beyond Solar system tests, since they are also sensitive to deviations that 
occur only in the strong-field environment of neutron stars. 

So far, GR has passed all these tests with flying colors. Will this continue for ever? 
Is GR our final answer to the macroscopic description of gravity? Pulsar astronomy 
will certainly continue to investigate this question. Many of the tests mentioned here 
will simply improve by continued timing observations of the known pulsars. In fact, 
the measurement precision for some of the post-Keplerian parameters increases fast 
with time. For instance, in regular observations (with the same hardware) the uncer- 
tainty in the change of the orbital period Pb decreases with denoting the 

observing time span. Improvements in the hardware, like new broad-band receivers 
(e.g. [113]), will further boost the timing precision. For pulsars like PSR J1738-h0333 
and PSR J0348-I-0432 soon the modeling of the white dwarf will be the limiting fac- 
tor, while for the Double Pulsar the corrections of the external contributions to Pb will 
be the challenging bit, in particular if one wants to reach the "^10“^ level at which 
higher oder contributions to [1 14, 115] and the Lense-Thirring contribution to the 
orbital dynamics [44, 50] become relevant (see [72] for a detailed discussion). The 
upcoming next generation of radio telescopes, like the Five-hundred-meter Aperture 
Spherical radio Telescope (FAST) [116] and the The Square Kilometre Array (SKA) 
[117], certainly promise a big step towards this goal. With SKA, for many pulsars 
one can hope for a factor of 100 improvement in timing precision [118]. The SKA 
also promises to provide excellent direct distance measurements to pulsars, either 
directly by utilizing the long baselines of the SKA to form high angular resolution 
images, or by fitting for the timing parallax in the arrival times of the pulsar signals 
[1 19]. In combination with new models for the gravitational potential of our Galaxy, 
in particular after new missions like GAIA [120], one will be able to accurately 
determine the extrinsic “contaminations” of Pb via Eq. (25), and by this know the 
intrinsic Pb. This is key for any high precision gravitational wave test with binary 
pulsars, but also crucial to measure the Lense-Thirring drag in the Double Pulsar 
[72]. 
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Reducing the parameter uncertainties for known pulsars is one way to push gravity 
tests forward, finding new, more relativistic systems is the other. Presently there are a 
number of pulsar surveys underway that promise the discovery of many new pulsars. 
New techniques, like acceleration searches [121] and high performance computing, 
e.g. Einstein ©Home [122], promise the detection of pulsars in tight orbits, which 
generally cannot be found with traditional methods. There is considerable hope 
among pulsar astronomers, that this will finally also lead to the discovery of a pulsar- 
black hole system, occasionally called the “holy grail” of pulsar astronomy. Such a 
system is expected to provide a superb new probe of relativistic gravity and black 
hole properties, like the dragging of spacetime by the rotation of the black hole 
[123-125]. According to GR, for an astrophysical black hole (Kerr solution) there 
is an upper limit for its spin, given by 5max = GM^/c. It would pose an interesting 
challenge to GR, if the timing of a pulsar-black hole system indicates a spin S > ^max • 
But even for gravity theories that predict the same properties for black holes as in 
GR, a pulsar-black hole system would constitute an excellent test system, due to the 
high grade of asymmetry in the strong-field properties of these two components (see 
[125] for simulations based on T\ (cto, Po) scalar-tensor theories). A pulsar in a close 
orbit (Pb < I yr) around the super-massive black hole (muH 4 X 10^ Mo) in the 
center of our Galaxy would be the ultimate test system, in that context. According 
to the mock data analysis in [126], for such a system a precise measurement of the 
quadrupole moment of the black hole, and therefore a test of the no-hair theorem, 
should be possible, provided that the environment of the pulsar orbit is sufficiently 
clean. Finding and timing a pulsar in the center of our Galaxy is certainly challenging. 
A promising result in that direction is the very recent detection of radio signals from 
a magnetar near the Galactic center black hole [127], even if this pulsar is still too 
far away from the super-massive black hole ("^0.1 pc) to probe its spacetime. 

Until now, all gravitational wave tests are based on probing the near-zone of a 
binary spacetime by measuring how the back reaction of the gravitational radiation 
changes the world lines of the source masses. As outlined above, with the Double 
Pulsar this test has reached a precision of better than 0.1 %. Presently there are 
considerable efforts to achieve a direct detection of gravitational waves, i.e. measure 
the far-field properties of such radiative spacetimes by using appropriate test masses. 
Ground based laser interferometric gravitational wave observatories, like LIGO and 
VIRGO, have mirrors with separations of a few kilometers. Their sensitivity is in the 
range from 10 Hz to few 10^ Hz. Planned space-based detectors, like eLISA, will have 
three drag-free satellites as test masses with a typical separation of ~10^km, and 
should be sensitive to gravitational waves from about 10“"^ to 0. 1 Hz. For the ultra-low 
frequency band (few nano-Hz) pulsar timing arrays are currently the most promising 
detectors [128]. In these experiments the Earth/Solar system and a collection of very 
stable pulsars act as the test masses. A gravitational wave becomes apparent in a pulsar 
timing array by the changes it causes in the arrival times of the pulsar signals. Due to 
the fitting of the rotational frequency u and its time derivative z> for every pulsar, such 
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a detector is only sensitive to wavelengths up to Tobs-^ This leads to the special 
situation that the length of the “detector arms” is much larger than the wavelength. 
As a consequence, the observed timing signal contains two contributions, the so- 
called pulsar term, related to the impact of the gravitational wave on the pulsar when 
the radio signal is emitted, and the Earth term corresponding to the impact of the 
gravitational wave on the Earth during the arrival of the radio signal at the telescope 
[131, 132]. The most promising source in the nano-Hz frequency band is a stochastic 
gravitational wave background, as a result of many mergers of super-massive black 
hole binaries in the past history of the Universe [133, 134]. With the large number 
of “detector arms”, pulsar timing arrays have enough information to explore the 
properties of the nano-Hz gravitational wave background in details, once its signal 
is clearly detected in the data. Are there more than the two Einsteinian polarization 
modes (alternative metric theories can have up to six)? Is the propagation speed 
of nano-Hz gravitational waves frequency depended? Does the graviton carry mass? 
These are some of the main questions that can be addressed with pulsar timing arrays 
[135, 136]. The isolation of a single source in the pulsar timing array data would 
give us a unique opportunity to study the merger evolution of a super-massive black 
hole binary, since the signal in the Earth term and the signal in the pulsar term show 
two different states of the system, which are typically several thousand years apart 
[137]. Eor these kind of gravity experiments, however, we might have to wait till the 
full SKA has collected a few years of data, which probably brings us close to the 
year 2030. 
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Abstract We make use of an effective field-theoretical approach to derive 
post-Newtonian equations of motion of hydrodynamical inhomogeneities in cos- 
mology. The matter Lagrangian for the perturbed cosmological model includes dark 
matter, dark energy, and ordinary baryonic matter. The Lagrangian is expanded in 
an asymptotic Taylor series around a Friedmann-Lemeitre-Robertson-Walker back- 
ground. The small parameter of the decomposition is the magnitude of the metric 
tensor perturbation. Each term of the expansion series is gauge-invariant and all of 
them together form a basis for the successive post-Newtonian approximations around 
the background metric. The approximation scheme is covariant and the asymptotic 
nature of the Lagrangian decomposition does not require the post-Newtonian pertur- 
bations to be small though computationally it works the most effectively when the 
perturbed metric is close enough to the background metric. Temporal evolution of 
the background metric is governed by dark matter and dark energy and we associate 
the large-scale inhomogeneities of matter as those generated by the primordial cos- 
mological perturbations in these two components with 6p/p < 1. The small scale 
inhomogeneities are generated by the baryonic matter which is considered as a bare 
perturbation of the background gravitational field, dark matter and energy. Mathe- 
matically, the large scale structure inhomogeneities are given by the homogeneous 
solution of the post-Newtonian equations while the small scale inhomogeneities are 
described by a particular solution of these equations with the stress-energy tensor 
of the baryonic matter that admits 6p/p ^ 1. We explicitly work out the field 
equations of the first post-Newtonian approximation in cosmology and derive the 
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post-Newtonian equations of motion of the large and small scale inhomogeneities 
which generalize the covariant law of conservation of stress-energy-momentum ten- 
sor of matter in asymptotically-flat spacetime. 



1 Introduction 

1.1 Brief Overview of Perturbation Techniques 



The multiwavelength satellite observations of cosmic microwave background (CMB) 
radiation have opened a new chapter in cosmology [1]. The standard cosmological 
model has been worked out to fit the model parameters to CMB [2-5] . The agreement 
between the standard theory and CMB observations was achieved at the decisive con- 
fidence level of 95 % [6] . The Planck satellite observations provide further evidences 
in robustness of the standard model [7, 8] (see http://www.sciops.esa.int/index.php? 
project=PLANCK for a comprehensive list of Planck collaboration papers) though it 
might be still difficult to discern between various scenarios of the early universe [9] . 

Study of the formation and evolution of the large scale structure in the universe is 
a key for understanding the present state of the universe and for prediction its utter- 
most fate [10]. It is extensively researched but, as of today, remains yet unsolved 
problem in physical cosmology. It is dark matter which plays a key role in struc- 
ture formation. The dark matter consists of weakly interacting massive particles and 
interacts with baryons only by the force of gravity. The baryonic matter forms galax- 
ies which, at early stage of structure formation, simply follow the evolution of dark 
matter. Therefore, the large scale distribution of galaxies is to trace the distribution 
of dark matter. At the linear stage the density contrast 5p = p — p is much smaller 
than the background (mean) density p of the universe: 6p/p 1. At later stages 
of cosmological evolution the structure formation enters non-linear regime when 
6p/p 1 and caustics are formed. Further growth of the perturbations leads to 
formation of small scale structures like nuclei of galaxies, dwarf galaxies, globular 
clusters, stars and more compact relativistic objects which have Sp/p ^ 1. Gravi- 
tational field and matter of these super-dense baryonic objects counteract with the 
gravitational potential of dark matter and dark energy but details of this process are 
still unknown because it involves fluid’s magnetohydrodynamics, turbulence and the 
physics of strong gravity field that implies a general relativistic approach taking into 
account the non-linear interaction of gravitational field with itself and the surround- 
ing matter. Presumably, some insight to the solution of this problem can be gained 
by exploring exact Lemaitre-Tolman cosmological solution of Einstein’s equations 
admitting spatially inhomogeneity along radial coordinate [1 1-13]. This purely geo- 
metric approach is mathematically sound but physically unrealistic as it describes a 
pressureless, spherically- symmetric accretion of dust to a single point on the cosmo- 
logical manifold while the real universe has a continuous set of the accretion points 
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determined by the initial spectrum of the primordial density fluctuations [3-5] and 
the baryonic fluid pressure cannot be ignored at non-linear regime. 

It is supposed that non-linear gravitational effects in the formation of small-scale 
structure in the universe can be treated with the help of the general relativistic post- 
Newtonian approximations. They had been developed in asymptotically-flat space- 
time by a number of researchers [14-18] and were very successful in describing 
non-linear gravitational effects of fluids, for example, in derivation of equations 
of motion of binary stars [19-22], in calculation of equilibrium models of rapidly 
rotating neutron stars [23-25], etc. Post-Newtonian approximations in the baryonic 
component of the cosmological matter are more entrapped because the background 
manifold is not flat and we have to take into account not only the perturbations of 
the background metric tensor, but those of the background stress-energy tensor 
of dark matter and dark energy as well. Furthermore, the formalism of the post- 
Newtonian approximations in cosmology is to separate the contribution of bare, 
small-scale perturbations of the baryonic matter from the large-scale perturbations 
of the background matter and the metric tensor of spacetime manifold. 

A number of theoretical attempts was undertaken to work out the first post- 
Newtonian approximation and equations of motion of perfect fluid on cosmological 
manifold [26-29]. These works provide a good insight to the possible solution of 
the problem but are insufficiently consistent in separation of perturbations from their 
background values. They do not suggest a systematic approach for extending the cal- 
culations of the linear perturbation theory to the second, and higher, post-Newtonian 
approximations either. We also point out a rather surprising result by Oliynyk [30, 3 1 ] 
who analysed the general structure of the post-Newtonian expansions on cosmologi- 
cal manifold and arrived to the conclusion that the post-Newtonian series are analytic 
with respect to the small parameter e — vjc, where f is a peculiar velocity of fluid 
with respect to the Hubble flow and c is the constant fundamental speed [32] . The 
Oliynyk’ s conclusion disagrees with that established for the post-Newtonian expan- 
sions in asymptotically-flat spacetime [33-35]. 

Recently, a new interest for developing a self-consistent theory of post-Newtonian 
approximations in precision cosmology was triggered by a lively discussion [36-40] 
on whether the small-scale structure of the universe affects its Hubble expansion rate 
and, thus, can explain the cosmic acceleration of the universe discovered in 1998-99 
[41, 42] without invoking a dark energy. This is a, so-called, backreaction problem 
which intimately relates to the procedure of averaging the small-scale matter pertur- 
bations on a curved cosmological manifold [43, 44]. A certain progress in this direc- 
tion was achieved but many mathematical aspects of the backreaction problem are 
still poorly understood [45]. The task is to build a rigorous mathematical formalism 
being able to describe on equal footing both the large-scale perturbations of the back- 
ground matter of cosmological manifold with the density contrast bp j p 1 and the 

small-scale perturbations at present epoch caused by small-scale structures (galaxy, 
globular cluster, star) having the density contrast bpjp^ 1. Einstein’s equations tell 
us that the density perturbation, (5p/p, is proportional to the second derivatives of the 
metric tensor perturbation which can be very large \i bp! p^ 1. At the same time, 
the metric tensor perturbation, — gap^ and its first derivatives, 
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can still remain small enough in order to apply a perturbation technique for solving 
the Einstein equations. This is similar to the situation in the solar system where the 
matter density contrast is huge but, nonetheless, the gravitational weak-field approx- 
imation for solving Einstein’s equations is fully applicable [46]. It supports the idea 
that the perturbation technique in cosmology (under above assumptions) is valid for 
calculating physical effects of inhomogeneities on both the large and small scales 
[39, 47, 48]. The question is what mathematical technique is the most adequate to 
deal with physical applications. 

Historically, the very first perturbation scheme in cosmology was worked out 
by Lifshitz [49, 50]. It is technically convenient for calculating time evolution of 
matter’s large scale structure inhomogeneities and gravitational field perturbations 
[4, 51] but is unsuitable for discussing the process of formation and time evolution 
of the small scale structures in universe. This is because the Lifshitz approach uses 
the synchronous gauge where the time-time component of the metric tensor is fixed, 
^00 = —1, at any order of approximation. The small scale structure in cosmology 
corresponds to a localized astronomical system having a large density contrast, bpjp, 
and governed by the Newtonian law of gravity which demands the presence of the 
Newtonian potential, f/, making ^foo = — 1 + 2f//c^ . It breaks down the synchronous 
gauge condition at the regime oibpjp^ 1. 

Bardeen’s perturbation approach [52] (see also [53]) is more flexible as it admits a 
rather large freedom in choosing a particular gauge condition for solving cosmolog- 
ical problems [3, 5]. In the framework of this approach, the longitudinal (conformal- 
Newtonian) gauge is the most appropriate for discussing the small scale structure for- 
mation and its physical effects [54]. Some mathematical disadvantage of Bardeen’s 
approach is in imposing a scalar- vector-tensor decomposition on the metric tensor. It 
requires application of the Helmholtz theorem [55] that demands to foliate spacetime 
by a family of spacelike hypersurfaces and to integrate the metric tensor over these 
hypersurfaces. It makes Bardeen’s approach non-local as contrasted to Lifshitz’s per- 
turbation scheme. Moreover, in order to preserve the gauge-invariance of the Bardeen 
perturbation scheme one has to decompose the gauge functions in the same fashion 
as the metric tensor. As the universe evolves the gauge-invariance of the overall 
Bardeen’s scheme can be preserved, if and only if, one maintains the mapping of 
spatial points on the foliations along the vector field of time coordinate world lines. 
Evidently, this approximation scheme differs significantly from the post-Newtonian 
approximations in asymptotically fiat spacetime [14, 46, 56] which are more similar 
to Lifshitz’s approach but use a different gauge condition (harmonic gauge). 

A gauge-invariant alternative to Bardeen’s approach was suggested by Ellis and 
Bruni [57] (see also [58, 59]), who developed a perturbation scheme based on a 
reduction of full Einstein’s equations down to a system of field equations that are lin- 
ear around a particular background. The Ellis-Bruni approach uses gauge-invariant 
variables which are spatial projections on the local comoving-observer frame thread- 
ing the entire space-time of a real universe. Thus, the Bardeen’s foliation has been 
replaced in Ellis-Bruni approach with the frame threading and, thus, observer depen- 
dent. This is not convenient, and is not used, for developing the post-Newtonian 
approximations in asymptotically-fiat spacetime. 
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At the epoch of precise cosmology we need more transparent theoretical scheme 
of the post-Newtonian approximations for handling the iterative calculation of cos- 
mological perturbations and derivation of their equations of motion. This iterative 
scheme must satisfy a number of well-established criteria like to be covariant, gauge- 
invariant, operate with locally defined quantities, be systematic and self-consistent in 
improving the order of approximations, clearly separate the large-scale from small- 
scale matter perturbations, be independent of the mathematical ambiguities intro- 
duced by the averaging procedures, etc. Some steps in developing such a scheme 
were done by Green and Wald [39, 40]. However, their work was focused mainly 
on the discussion of the averaging procedure in cosmology, on the proof that the 
small-scale, post-Newtonian inhomogeneities do not produce a noticeable backre- 
action and on finding mathematical evidences that the Newtonian approximation is 
sufficient in numerical N-body simulations of large scale structure formation [60]. 

No doubt, theoretical questions about how to perform the averaging in cosmology 
and whether it produces any backreaction at all, are important for understanding the 
mathematics of averaging of differential operators in non-linear equations and for 
clarification of the true nature of dark matter and dark energy. However, the post- 
Newtonian approximation scheme in cosmology has broader implications that are 
going beyond the discussion of averaging and backreaction problems and relates to 
the problem of interpretation of precise measurement of cosmological parameters 
by the advanced gravitational wave detector’s technique [61, 62] and formation of 
small-scale structures in the universe at the non-linear regime. The formalism of the 
post-Newtonian approximations can be also helpful in better understanding of the 
influence of cosmological expansion on celestial mechanics of isolated astronomical 
systems like binary pulsars which are currently the best laboratories for testing non- 
linear regime of general relativity [63, 64]. These tests will be made significantly 
more precise with advent of gravitational-wave astronomy and Square Kilometer 
Array (SKA) radio telescope [65]. 

Recently, we have started a systematic investigation of the dynamics of small- 
scale inhomogeneities moving on the FLRW background manifold. We have set up 
a Lagrangian formalism to derive the post-Newtonian field equations for linearised 
cosmological perturbations [48] and analysed the Newtonian limit of these equations 
[66]. The present paper goes beyond the linear regime and explores some non- 
linear effects. In particular, we derive the post-Newtonian hydrodynamic equations 
of motion of the background matter (dark matter and dark energy) along with the 
equations of motion of the baryonic matter forming a small-scale structure with 
high-density contrast like a star, or galaxy or a cluster of galaxies. 

We discuss the concept of the covariant and Lie derivatives on manifold in Sect. 2. 
Geometric theory of Euler-type variational perturbations of arbitrary background 
manifold is set up in Sect. 3. This theory is applied to the FLRW universe, governed 
by dark matter and dark energy, in Sect. 4. Section 5 derives the stress-energy tensors 
for perturbations of the gravitational field, dark matter and dark energy. Finally, 
we derive equations of motion of the small-scale (bare) perturbations in Sect. 6. 
Appendix outlines some particular mathematical aspects of our derivation. 
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Before going into details of our presentation we explain the notations adopted in 
the present paper. 



1.2 Notations 

We use G to denote the universal gravitational constant and c for the ultimate speed 
in Minkowski spacetime. Every time, when there is no confusion about the system 
of units, we use a geometrized system of units where G = c = 1. We put a bar over 
any function that belongs to the background manifold of the FLRW cosmological 
model. Any function without such a bar belongs to the perturbed manifold. The other 
notations used in the present paper are as follows: 

• T and = {X, Y, Z) are the coordinate time and isotropic spatial coordinates on 
the background manifold; 

• X^ = {X^ , X^} = [cT], X^] are the conformal coordinates with r] being a confor- 
mal time; 

m = {ct,x^} is an arbitrary coordinate chart on the background 

manifold; 

• Greek indices a, (3,^, .. . run through values 0, 1, 2, 3, and label spacetime coor- 
dinates; 

• Roman indices i, j,k, . . . take values 1, 2, 3, and label spatial coordinates; 

• Einstein summation rule is applied for repeated (dummy) indices, for example, 
P'^Qa = Qo + P^Qi + P^Q 2 + P^Q 3 , and P’Qi = P^ Qi + P^ Qi + P^ Q3\ 

• Qap is a full metric on the cosmological spacetime manifold; 

• gap is the ELRW metric on the background spacetime manifold; 

• — the metric tensor density of a unit weight -\-l; 

• — the background metric tensor density of a unit weight -\-l; 

• f q ;/3 is the metric on the conformal spacetime manifold; 

• = diag{— 1, + l,-|-l,-|-l}is the Minkowski metric; 

• the scale factor of the ELRW metric is denoted as R = R(T), or SiS a = a (g) = 
R[T(v)]; 

• the Hubble parameter, H = R~^dR/dT\ 

• the conformal Hubble parameter, l~i = a' ja\ 

• T denotes a geometric object on the manifold. It can be either a scalar, or a vector, 
or a tensor field, or a corresponding tensor density; 

• a bar, f above a geometric object T, denotes the unperturbed value of T on the 
background manifold; 

• the tensor indices of geometric objects on the background manifold are raised and 
lowered with the background metric gap, for example J^ap = 

• the tensor indices of geometric objects on the conformal spacetime are raised and 
lowered with the conformal metric tap\ 

• symmetry of a geometric object with respect to two indices is denoted with round 
parenthesis, = (1/2) (^a/3 + ^/3a); 
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• antisymmetry of a geometric object with respect to two indices is denoted with 
square parenthesis, T\ap\ = (1/2) {Tap - Tpa)\ 

• a prime = dT j dr] denotes a total derivative with respect to the conformal 
time 77; 

• a dot T = dJ^ I dT denotes a total derivative with respect to the coordinate time T ; 

• da = d / dx^ di partial derivative with respect to the coordinate 

• a comma with a following index T^a = is an other designation of a partial 

derivative with respect to a coordinate which is more convenient in some 
cases. In some cases which may not cause confusion, the comma as a symbol of 
the partial derivative is omitted. For example, we denote the partial derivatives of 
the perturbations of matter variables as <pa = etc.; 

• a vertical bar, !F\a denotes a covariant derivative of a geometric object !F with with 
connection referred to the background metric Qaf]. Covariant derivatives of scalar 
fields coincide with their partial derivatives; 

• a semicolon, J^-a denotes a covariant derivative of a geometric object with the 
connection referred to the conformal metric fap', 

• a covariant derivative with the connection referred to the full metric, gap is denoted 
with the help of Vq,; 

• — a multiplet of A = {1,2, a} matter fields. In other words, denotes 

a set of fields . . . , where each of the fields ... is a 

tensor density of its own weight mi, m2, . . . respectively. These fields generate 
the full metric of FLRW universe via the Einstein equations; 

• — the background value of the fields . These fields generate the background 

metric of FLRW universe via the unperturbed Einstein equations; 

• — a multiplet of 5 = {1,2, . . . , b} matter fields, ©^ = {©^ ©^, . . . , ©^} 
which can be also tensor densities. They generate the stress energy tensor of the 
bare perturbation of the metric tensor g^jy and that of the fields 

• ©^ — the background value of the fields ©^. 

• — the perturbation of the field Fields and refer to the 

same point on the manifold; 

• = ©^ — ©^ — the perturbation of the field ©^ caused by the counteraction 
of the metric tensor perturbations l^jy and those of the dynamic fields (j)^ on the 
stress-energy tensor of the bare perturbations; 

• H^jy = gfyiy ~ g jyjy — tho motric tensor perturbation. Fields g^^y and g^^y refer to the 
same point on the manifold; 

• — the perturbation of the metric density caused by ©^; 

• In a linear approximation, a^ where 

• the Christoffel symbols, {gvp.-i + gi^j.p ~ gp'j.u ) : 

• the Riemann tensor, R°^pfj,p = ~ ~ 

• the Ricci tensor, Rap = R^a^p \ 

• the Ricci scalar, R = g^^ Rap • 



696 



S.M. Kopeikin and A.N. Petrov 



We shall often employ the term on-shell. By on-shell we mean satisfying the equations 
of motion. For instance, Noether’s theorem links conserved quantities to symmetries 
of the system on- shell. It is invalid off- shell. We shall introduce and explain other 
notations as they appear in the main text of the paper. 



2 Derivatives on the Geometric Manifold 
2.1 Variational Derivative 

Theory of perturbations of physical fields on manifolds rely upon the principle of 
the least action of a functional S called action. Variational derivative arises in the 
problem of finding solutions of the gravitational field equation that extremize the 
action 



where is a scalar density of weight -hi. Let = T {Q, Qa, Qaf]) 

depend on the field variable Q, its first — Qa = Q,a and second — 
partial derivatives that play here a similar role as velocity and acceleration in the 
Lagrangian mechanics of point-like particles. The field variable Q can be a tensor 
field of an arbitrary type with the covariant and/or contravariant indices. For the 
time being, we suppress the tensor indices of Q as it may not lead to a confusion. 
Function T depends on the determinant g of the metric tensor and can also depend 
on its derivatives. We shall discuss this case in the sections that follow. 

A certain care should be taken in choosing the dynamic variables of the Lagrangian 
formalism in case when the variable 2 is a tensor field. For example, if we choose 
a covariant vector field as an independent variable, the corresponding “velocity” 
and “acceleration” variables must be chosen as and respectively. On the 
other hand, if the independent variable is chosen as a contravariant vector A^, the 
corresponding “velocity” and “acceleration” variables must be chosen as A^,c^ and 
A^,q;/ 3 . The same remark is applied to any other tensor field. The reason behind is that 
A^ and A^ are interrelated via the metric tensor, A^ = g^^Ai^. Therefore, derivative 
of A^ differs from that of A^ by an additional term involving the derivative of the 
metric tensor which, if being improperly introduced, can bring about spurious terms 
to the field equations derived from the principle of the least action. 

Variational derivative, 5T ! 5 Q, taken with respect to the variable Q relates a 
change, 5S, in the functional 5 to a change, 5T, in the function that the functional 
depends on. 




( 1 ) 




( 2 ) 



where 
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dQ dQa dQa[3 



^Qa(5' 



(3) 



This is a functional increment of The variational derivative is obtained after we 
single out a total divergence in the right side of (3) by making use of the commutation 
relations, SQa = (SQ)^a and SQa^ = The total divergence is reduced to 

a surface term in the integral (2) which vanishes on the boundary of the volume of 
integration. Thus, the variation of S with respect to Q is given by 



5S = 




(4) 



where 



bT _dT d dT dT 

5Q dQ dx^ dQa dx^dxf^ dQaf]' 

Similar procedure can be applied to S by varying it with respect to and Qa^-l^ 
such a case we get the variational derivatives of ^ with respect to 



bT _ dT d dT 

SQa dQa dxl^ dQa/s' 

and that of with respect to Qap, 

bT _ dT 
^QajS ^QaP 



( 6 ) 



(7) 



Let us assume that there is another geometric object, T (g, Qa, Qa^), which 
differs from the original one T [Q, Qa, Qa(5)^y ^ total divergence 

T (e, Qa. Qap) = T {Q, Qa, Qap) + (Q, Qa) - ( 8 ) 



It is well-known [67, 68] that taking the variational derivative (5) from T and T 
yields the same result 



6T _ 5T 
SQ^SQ^ 



(9) 



because the variational derivative from the divergence is zero identically. In fact, it is 
straightforward to prove a more general result, namely, that the variational derivative 
(5), after it applies to a partial derivative of an arbitrary smooth function, vanishes 
identically 






dT 
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However, this property does not hold for a covariant derivative in the most general 
case [68]. 

The variational derivatives are covariant geometric object that is they do not 
depend on the choice of a particular coordinates on manifold [46, 68]. In case, 
when the dynamic variable Q is not a metric tensor, this statement can be proved by 
taking the first, Qql = Q,a^ and second, Qaf 3 = Q-a(3^ covariant derivatives of Q as 
independent dynamic variables instead of its partial derivatives, Qa and Qa(3- In this 
case the procedure of derivation of variational derivatives (5), (6) remains the same 
and the result is 









dT 

dQa 



+ 



.dQa/3. 



‘,(3a 



( 11 ) 



The order, in which the covariant derivatives are taken, is imposed by the procedure 
of the extracting the total divergence from the variation of the action in (2). The order 
of the derivatives is important because the covariant derivatives do not commute. 

Variational derivative of with respect to the metric tensor g^jj is defined by the 
same Eqs. (5)-(7) where we identify Q = Qa = 9i^iy,a, and = 5^,., a/3- It 
yields 

bT _ dT d dT d^ dT 

dQij^y dx^ dg^y^a dx^dxf^ dg^y^Q,g 

5T _ dT d dT 

^9[My,a dg^xy^a dx^ dg^y^cxf] 

5T _ dT 

^9[iu,a(3 dg^y^ag 



( 12 ) 

(13) 

(14) 



Covariant generalization of (12)-(14) is not quite straightforward because the covari- 
ant derivative of the metric tensor g^y;a = 0. and we cannot use it as a dynamic 
variable. In this case, we consider the set of the metric tensor, g^y, the Christoffel 
symbols T^^y, and the Riemann tensor R^g^y as a set of independent dynamic vari- 
ables. The action is given by (1) where T = a/^/ {g^v, g^y) is a scalar 

density of weight -h 1 . Variation of T is 



5T = 






(15) 



where variations of the Christoffel symbols and the Riemann tensor are tensors that 
can be expressed in terms of the variation 5g^y of the metric tensor [51] 






(16) 

( 17 ) 
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Now, we replace (16), (18) in (15) and single out a total divergence.^ It yields 

(5^=— + (18) 






liu 



where the total divergence vanishes on the boundary of integration of the action, and 
the covariant variational derivative is 



5T 



dT 






flU 






^(Tfl 



dT 



+ 9° 



dT 



dT 



dr 



+ (9^^^,^ +9^'' 



jia 

dT 



dr 



I 



HV / 

dT 



dR^ 



a flu 



dR^ 






dR^ 



y. 






(19) 



Variational derivative with respect to the contravariant metric tensor is 



5T dgaf ST 



Sgai3 



— 9afi9flu 



ST 

Sgai 



( 20 ) 



The variational derivatives are not linear operators. For example, they do not obey 
Leibniz’s rule [69, Sect. 2.3]. More specifically, for any geometric object, H = 
that is a corresponding product of two other geometric objects, T — T {Q^ Qa^ Qap) 
and T = T (^Q, Qa, variational derivative 



6{J^r) ^ 8T ^ 5T 
8Q ^ 5Q ^ 5Q^ 



( 21 ) 



in the most general case. The chain rule with regard to the variational derivative is 
preserved in a limited sense. More specifically, let us consider a geometric object 
(^Q, Qa, Qap) where 2 is a function of a variable P, that is 2 = Q(P). 
Then, the variational derivative 



_ SJ^dQ 

Jp ~ JqJp' 



( 22 ) 



that can be confirmed by inspection [70]. On the other hand, if we have a singled- 
valued function H = 7i(Q), and Q = Q (^P, P^, Pa(5)^ the chain rule 



5H _ dT5Q 

Jp ~ JqJp' 



^The fact that is a scalar density is essential for the transformation of covariant derivatives to the 
total divergence. The total divergences can be converted to surface integrals which vanish on the 
boundary of integration and, hence, can be dropped off the calculations. 
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is also valid. The chain rule (23) will be often used in calculations of the present 
paper. 



2.2 Lie Derivative 



Lie derivative on the manifold can be viewed as being induced by a diffeomorphism 






(24) 



such that a vector field has no self-intersections, thus, defining a congruence of 
curves which provides a natural mapping of the manifold into itself. Lie derivative 
of a geometric object is denoted as It is defined by a standard rule 

(25) 



where T' is calculated by doing its coordinate transformation induced by the change 
of the coordinates (24) with subsequent pulling back the transformed object from the 
point to along the congruence [46]. In particular, for any tensor density 
of type {p,q) and weight m one has 






S ,a ~r inq v\...Vq 

I dOi _1_ _1_ 'Tr^^"'^P dOi 

I j~ a...Vq s ,Z^1 “T • • • “T U\...a s 

_ 'P^^^^^^P dl^l _ . . . _ df^p 

,a ,«■ 



(26) 



We notice that all partial derivatives in the right side of Eq. (26) can be simultaneously 
replaced with the covariant derivatives because the terms containing the Christoffel 
symbols cancel each other. 

The Lie derivative commutes with a partial (but not a covariant) derivative 



da (daJ^) . 



(27) 



This property allows us to prove that a Lie derivative from a geometric object 
^ {Q, Qa, Qap) can be calculated in terms of its variational derivative. Indeed, 

dJ^ dJ~ dJ~ 

^ — £iQai3- (28) 

dQ dQa OQafj 

Now, after using the commutation property (27) and changing the order of partial 
derivatives in £^Qa and £^Qa(3^ one can express (28) as an algebraic sum of the 
variational derivative and a total divergence 
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+ + (29) 

This property of the Lie derivative indicates its close relation to the variational deriv- 
ative on the manifold and will be used in the calculations that follow. It is also 
worth pointing out that (29) is used for derivation of Noether’s theorem of conserva- 
tion of the canonical stress-energy tensor of the field Q in case when ^ £ is the 

Lagrangian density of the field for which the variational derivative vanishes on-shell, 
6J^/5Q = 6C/5Q = 0, and £^C = da 



2.3 Partial Derivatives with Respect to the Metric Tensor 



Calculation of variational derivatives requires calculation of partial derivatives with 
respect to the metric tensor and other geometric objects like the Christoffel symbols, 
the Riemann tensor, etc. An example is the partial derivatives from the determinant 
of the metric tensor 



_ 1 ! — 






(30) 



where g is the determinant of the metric tensor. 

Taking partial derivatives from T with respect to and is facilitated 

with the help of the following formulas 



dgap 

dg^^iy 

5TV 






(31) 

(32) 

(33) 



where we have accounted for the symmetry of the Christoffel symbols and the anti- 
symmetry of the Riemann tensor. 

Variational derivative with respect to the contravariant metric tensor is achieved 
with the help of the derivative 



^ dga(3 5 5 

SgiJ-’' dg!^'' 5gap Sgais' 



(34) 



We will also need to calculate the variational derivative with respect to the density 
of the metric tensor, . It relates to the variational derivative of the metric tensor 
as follows. 
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5q!^^ OqI^^' S gP^ 5 gP^ ’ 

where 

Partial derivatives of any geometric object with respect to either the metric tensor 
or its first, or second derivatives can be easily calculated after making use of 

= (37) 

= (38) 

= 5f5^'^5^^P5l\ (39) 



dg°‘f^ 
dgP'' 
d9°‘^ ,'1 
dgP\a 

dg!^\pa 



(35) 

(36) 



3 Geometric Perturbation Theory of Arbitrary 
Background Manifold 

Let US consider a field theory on a background pseudo-Riemannian manifold M 
having the metric tensor Qc^f] that is a solution of Einstein’s equations with the stress- 
energy tensor of the physical fields where the index A numerates the fields 

and takes values A = 1,2, ... ,a. Let us perturb the manifold by “injecting” into 
it some matter with a bare stress-energy tensor that is generated by a set of 
physical fields where the index B numerates the bare fields and takes values 
B = 1,2, ... ,b. We denote the unperturbed value of the bare fields, ©^ . The tensor 
%a[3 is the source of the bare perturbations of the background values of the metric 
tensor g^^JJ and the fields on the background manifold M. We assume that the 
fields and ©^ are both minimally coupled to the curvature of spacetime in the 
sense of the strong equivalence principle [46, Sect. 3.8.2], [71, Sect. 6.13]. We also 
assume for the sake of simplicity that the fields and ©^ do not directly interact 
one with another. The basic geometric theory of the present paper is Einstein’s general 
relativity. Other geometric theories beyond general relativity can be considered after 
making corresponding modifications in the Lagrangian formalism presented in this 
paper but we shall not proceed in this direction here (see, e.g. [72]). 

The perturbation caused by tensor Tq ,/3 changes the metric to g^g = ga(5 + 
>Coi(5^ and the fields O^toO^ = where and (j)^ are the perturbations of 

the metric and the physical fields respectively. Due to the non-linearity of Einstein’s 
equations the perturbations interact one with another through gravitational coupling 
in Einstein’s equations. It makes the geometric structure of the perturbed manifold 
M rather entangled. This section describes how to find out the perturbed structure 



Equations of Motion in an Expanding Universe 



703 



of the manifold A4, the field equations, and the equations of motion of matter on the 
basis of the Lagrangian variational principle [70, 73]. 



3.1 Variational Principle 

Lagrangian formulation of the dynamic theory of physical perturbations starts off 
the Hilbert-Einstein action defined on an unperturbed (background) manifold M 

S = j (40) 

where C = is a scalar density of weight +1, L = L is the 

Lagrangian depending on the metric tensor, the multiplet of matter (tensor den- 
sity) fields = {<i>^ . . . , and their partial derivatives. We did not show 

explicitly (but are keeping in mind) the dependence of the Lagrangian on the deriv- 
atives to avoid superfluous notations. The tensor fields determine the dynamic 
and geometric structure of the background manifold M via Einstein’s equations. Eor 
short, we shall call the Lagrangian density C simply the Lagrangian. 

The Lagrangian C is split in two parts 

£ (sm- fe-) + ^ (41) 

where the gravitational (Hilbert) Lagrangian 



C^gap)^-^s/^R, (42) 

depends on the background metric tensor g^j^, its first and second derivatives. The 
matter Lagrangian, jC^ depends solely on the metric tensor and (for instance, in case 
of Yang-Mills fields) its first derivatives. Of course, it also depends on the matter 
fields 0 ^ and their derivatives. 

Dynamic equations of the gravitational field and matter are derived from the 
principle of the least action by varying the action (40) and equating its variation to 
zero. This procedure is equivalent to taking the variational derivatives (5) from the 
Lagrangian (41) that yields 



5CP 5C^ 



(43) 



( 44 ) 
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Equation (43) describes dynamic evolution of the matter fields 4>^. Equation (44) 
can be recognized as the Einstein equations for the background gravitational field 
(the metric) after noticing that the variational derivatives 



SC° 



1 - 



(45) 

(46) 



where Rf^i, is the background value of the Ricci tensor calculated with the help of 
the background metric and is the stress-energy tensor of the fields 
Equation (46) is just a definition of the metrical stress-energy tensor of matter [46, 
Sect. 3.9.5]. Equation (45) is usually derived by varying the gravitational action (see, 
for instance, [46, p. 3 10] , [5 1 , p. 364]) and extracting the total derivative that vanishes 
on the boundary of the volume of integration. We emphasize that the variational 
derivative (45) given by Eqs.(19) and (20) achieves the same goal in much more 
simple and attractive way. Indeed, the Lagrangian (42) is equivalent to 






(47) 



This expression depends only on the metric tensor and the Riemann tensor as a 
linear algebraic function. Hence, partial derivatives with respect to the Christoffel 
symbols are automatically nil. Moreover, the covariant derivatives from the metric 
tensor vanish identically. Hence, the variational derivative (20) is reduced to a simple 
partial derivative 



( 5 £° 1 - 

^ D 

167T ^ 



(48) 



that immediately results in (45). Of course, we could calculate the variational deriva- 
tive in (45) by applying a more conventional definition (12). It yields the same result 
but the calculations are lengthy and less obvious. We explain details of this calculation 
in Appendix section “Variational Derivative from the Hilbert Lagrangian”. 

Eor many astrophysical applications the background value of the metric tensor 
gfj^jy is the Minkowski metric which is a trivial solution of the field equations 
(44). However, this assumption is not applicable in cosmology where (44) are the 
dynamic Eriedmann equations describing the expanding universe by means of the 
ELRW metric 



ds^ = -dT^ + + 5ijdX‘dXJ, (49) 

where = (T, X^) are the global coordinates associated with the Hubble flow, 
r = JdijX‘XJ, T is the cosmic time, k is the curvature of space taking one of 
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the three values k = —1, 0, +1, and R(T) is the scale factor which exact time- 
dependence is governed by the solution of Einstein’s equations with the background 
stress-energy tensor determined by the matter fields [4, 5, 51]. 

Physical perturbations of the background manifold A4 are caused by the bare mat- 
ter field 0^ with the Lagrangian where 0^) is a scalar 

function. The present paper assumes that 0^ is minimally coupled with gravity but 
does not interact directly with the fields We postulate that the absolute value 
of is much smaller than the Lagrangian C of the background manifold, that is 
C. The fields 0^ can be conceived, for example, as a matter composing of an 
isolated astronomical system like our solar system or a galaxy, or a cluster of galaxies. 
However, it also admissible to consider 0^ as a seed perturbations of the fields 
for example, in case of discussion of the formation of the large-scale structure of the 
universe from the primordial cosmological perturbations. The assumptions imposed 
on the Lagrangian of the fields 0^ presume that in order to describe the dynamic 
evolution of the perturbed manifold A4 we should add algebraically the Lagrangian 
of the bare perturbations to the unperturbed Lagrangian £ of the background man- 
ifold, write down the perturbed Einstein equations for the metric tensor perturbations 
along with the equations for the perturbations (j)^ of the fields solve them, 
and proceed to the second, third, etc. iterations if necessary. The iterative theory of 
the Lagrangian perturbations of the manifold is described in the following sections. 



3.2 The Lagrangian Perturbations of Dynamic Fields 

Lagrangian formulation of the dynamic theory of physical perturbations of a manifold 
starts off the Hilbert-Einstein action 



where £ = is the scalar density of weight + 1 , and L = 0 ^) is the 

Lagrangian depending on the metric tensor, the matter fields ={0 ^ , . . . , }, 

and the fields 0 ^ = { 0 ^ 0 ^, . . . , 0 ^} representing the bare perturbation of the 
manifold. 

The Lagrangian C consists of three parts 



C 0®) = ( 5 ^^, <5^) + r ( 5 ^^, 0®) , (51) 




(50) 



where the gravitational (Hilbert) Lagrangian 



{gafs) = 



(52) 
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depends on the metric tensor its first and second derivatives. The Lagrangians of 
matter, and depend solely on the metric tensor and its first derivatives. They 
also depend directly on the matter fields and and their partial derivatives 
but we did not show it explicitly to avoid tedious notations. The matter fields 
and ©^ are minimally coupled to gravity but we assume that they are not directly 
coupled to each other. Hence, the Lagrangian of the interaction between these fields 
does not appear explicitly in (51). This assumption can be relaxed in some alternative 
theories of gravity [74]. Such cases can be handled with the formalism of the present 
paper but the computational algebra becomes more intricate and will be considered 
somewhere else. 

It is worth noticing that and depend on the metric tensor both explicitly 
and implicitly through the mathematical definition of the matter fields and ©^. 
For example, consider the Lagrangian of the perfect fluid = p(l -\-Tl) 
where H is the specific internal energy of the fluid and p is the energy density. The 
metric tensor appears explicitly as and implicitly in p that is defined as the ratio 
of the rest energy of the fluid’s element to its comoving volume which depends on 
the determinant of the metric tensor [46] . 

We define perturbations of the gravitational and matter fields residing on the 
background manifold by the following equations. 



where all functions are taken at one and the same point v = of the unperturbed 
manifold. The perturbed values of the fields are assumed to be small compared with 
their background counterparts: |()^^| < and \0^\ < |©^|. There 

are no specific limitations on the rate of change of the perturbations that is on their 
first partial derivatives. The second partial derivatives of the fields are comparable 
(due to the field equations) with the magnitude of the stress-energy tensor, of 
the bare perturbations that is ^ ,ap\ ^ 

We consider the perturbations , (j)^ along with the perturbing field 0^ as a set 
of independent dynamic variables which propagate on the background manifold M 
with the metric g^jj . In order to derive differential equations governing the evolution of 
the perturbations we substitute the field decompositions (53), (54) to the Lagrangian 
C defined by Eq. (51) which yields 

C = C^ (0^^ + A ^ ^ ^ ^ ^ ^ ^ ^ (55) 

Because the perturbations are linearly superimposed with the background 

values of the metric tensor g^jj and the fields O, ©^ respectively, the perturbed (total) 
Lagrangian (56) admits the following property of the variational derivatives, 



Q^-{x) = r^{x) + i)^"(x), 
<t>'^(x) = 4>^(x) + (/)"' (x), 
0®(x) = ©®(x) + 6>^(x), 



(53) 

(54) 

(55) 
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5C _ ^ ^ _ SjC 

5(j)^ ’ 50^ 

These relations allow us to replace the variational derivatives with respect to the 
dynamic perturbation of the field for that taken with respect to the background value 
of the field. It turns out to be useful in calculations that follow. 



3.3 The Lagrangian Series Decomposition 

Let us now expand the total Lagrangian (56) in a Taylor series by making use of the 
variational derivatives of C with respect to and The Taylor expansion of C 
with respect to 0 is not important at this stage of the calculation procedure because 
physical measurements yield access to the total value of the bare perturbation ©. 

We assume at the beginning that the perturbations and their derivatives are small 
enough to ensure the convergence of the expansion. For the Lagrangian is a function 
of several variables, the Taylor series has terms with mixed derivatives starting from 
the second order. There is a nice way of handling the mixed derivatives in the Taylor 
expansion of the Lagrangian by noticing the following property of the commutator 
of two variational derivatives [70] 




where denotes a vector density of weight +1 made of the partial derivatives 
from the background Lagrangian C, and the repeated field label A denotes Einstein’s 
summation over all fields This commutation rule is also valid for any two fields 

from the field multiplet Equation (58) allows us to change the order of the 

variational derivatives to reshuffle the terms with the mixed derivatives in the Taylor 
expansion of the perturbed Lagrangian C. All terms representing the total divergence 
are omitted from the Taylor expansion since the variational derivative from them 
vanishes and they do not contribute to the field equations according to (8), (9). After 
all terms with the mixed derivatives are put in order by applying the commutator rule 
(58) and the total divergences are discarded, the Taylor expansion of the Lagrangian 
takes the following simple form 



C — Cn 



(59) 



Here, is the Lagrangian of the bare perturbation, Cq = Ci^ the Lagrangian (42) 
describing dynamic properties of the background manifold, and for any n>\. 



n \ 



y5Cn- 









O' 



(60) 
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represents a collection of terms of the power n with respect to the perturbations 
and <1)'^. In particular, the linear and quadratic terms of the expansion in (59) read 



^1 



= 



5C 






SC 



Cl 



1 

2 







SCi \ 

S^^J’ 



(61) 

(62) 



and so on. Equation (60) can be proved by induction starting from the value of £i in 
(61) which is apparently true, and operating with the commutation rule (58) in higher 
orders in order to confirm that the result is reduced to the original Taylor series. The 
commutation property (58) of the variational derivatives allows us to write down the 
Taylor expansion (59) as follows 



that establishes a relation between the perturbed and unperturbed Lagrangians in the 
most succinct, exponential form. 

By applying Eq. (57) to the Taylor series (59), and making use of = 

we get an important relation between the variational derivatives of the 
consecutive terms in the series 

These relations can be confirmed directly by making use of (60) that establishes a 
relation between the adjacent orders of the Lagrangian expansion (59). 

The Lagrangian of the bare perturbation can be also expanded in the Taylor series 
with respect to 



where we have defined C® = £^(©^, 0^^). However, in practical calculations it 
is more convenient to hold unexpanded, keeping in mind that at each iteration 
the metric tensor g/^i^ and the field © entering are known up to the order of the 
approximation under consideration. 
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3.4 Dynamic and Effective Lagrangians 



The principle of the least action tells us that the Lagrangian (56) must be stationary 
with respect to variations of the metric tensor and the field variables 



5C 5jC 

= 0, r 



( 66 ) 



We also assume that the background Lagrangian (5 1) is stationary with respect to the 
variations of the background variables and <i>^, and the field equations (43), (44) 
are valid. It means that the variational derivatives with respect to and (j)^ from 
the background Lagrangian Cq = C vanish identically. Therefore, applying Eq. (64) 
to ^ = 1 yields 



5Cx _ SjC 

SjCi _ SjC 
S(j)^ ~ S^^ 



(67) 

( 68 ) 



due to the background field equations (43), (44). 

Equations (67), (68) point out that dynamics of physical perturbations are gov- 
erned solely by the quadratic, cubic and higher-order polynomial terms in the 
Lagrangian decomposition (59). We define the dynamic Lagrangian of the dynamic 
perturbations as follows [70, 73] 



£dyn = £2 + £3 + ■ ■ ■ , 



(69) 



SO that the total Lagrangian (59) can be written down in the following form 



£ = £ + £1 + £'^>'" + £’’. (70) 

The background Lagrangian, £, does not depend on the dynamic variables, 

(j)^ and 0^ . Hence, the variational derivative from iC taken with respect to any of 
these variables is identically zero. On the other hand, the variational derivatives 
from C\ taken with respect to and/or (j)^ vanish on- shell due to the background 
field equations, as shown in (67), (68). Hence, the Lagrangian perturbation theory of 
dynamic fields residing on the background manifold can be built on- shell with the 
effective Lagrangian 

£eff = £dyn ^ (7^^ 

The effective Lagrangian is convenient for deriving the field equations of the physical 
perturbations and equations of motion of matter which are discussed in the rest of 
the present paper. 
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3.5 Field Equations for Gravitational Perturbations 



The Einstein field equations for the metric perturbations are obtained after taking the 
variational derivative (5) from the effective Lagrangian with respect to and 
equating it to zero, 






(72) 



Because of (67), it is equivalent to equation 5 (C — C) — 0 or, after applying 
(57), to (5 (£ — £) = 0. Replacing £ in this equation with expansion (70) and 



accounting for the background Einstein equations, = 0, we recast (72) into 

the following form 



5jCi _ 

6gf^^ ~ 6gf^^ ’ 



(73) 



where we have used (35) in order to replace the variational derivative with respect to 
Qiiiy respect to gf^^. The Euler-Lagrangian equation (73) is convenient 

to work with. It is worth emphasizing that is fully equivalent to the first variational 
equation (66). 

By taking the variational derivatives one can reduce Eq. (73) to a more tractable 
tensor form 

+ C = (74) 



where 



2 



(75) 



is the effective stress-energy tensor and the left side of (74) is a Laplace-Beltrami 
operator for tensor fields on curved manifolds [48] that consists of two parts 






pM 

IIV 



\ 6 tt S 
\ 6 tt S 









) 



(76) 

(77) 



Operator describes perturbation of the Ricci tensor and can be easily calcu- 
lated on any background manifold. Indeed, taking into account (45), we immedi- 
ately get 






( 78 ) 
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Now, according to the rule of rising and lowering indices of the variational derivatives, 
we can recast (78) to 



Variational derivative in (79) is calculated with the help of the co variant definition 
(19) where the covariant derivatives are taken on the background manifold and are 
denoted with a vertical bar. We recall that is an independent dynamic variable 
while the term under the sign of the variational derivative in (79) depends merely on 
the background Riemann tensor. Therefore, the derivative in (79) is taken only only 
with respect to the Riemann tensor in accordance with (19). It yields 



where we have taken into account that the expression enclosed in the brackets, is 
symmetric with respect to indices a and f3. We substitute (80) to (79) and recollect 
definition of along with the constancy of the background metric 

tensor with respect to the covariant derivative. It results in 



where each vertical bar denotes a covariant derivative with respect to the background 
metric and 

Operator describes perturbation of the stress-energy tensor of the matter 
governing the evolution of the background manifold. Hence, it vanishes on any Ricci- 
fiat spacetime manifold in general relativity. Cosmological FLRW spacetime is not 
Ricci fiat. Therefore, makes a non-trivial contribution to the field equations 
for gravitational perturbations. Variational derivative in definition (77) of is 
taken from the Lagrangian, characterizing the background matter fields and 
depends crucially on its particular form which must be specified in each individual 
case. We can bring (77) to a more explicit form by accounting for the definition of 
the metrical stress-energy tensor of the background matter [46] 




(79) 




(80) 




( 81 ) 






( 82 ) 
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and introducing a new function 



;m_ 2 



(83) 



We notice that vanishes on-shell because of the field equation (43). However, 
this equation should not be applied immediately in the definition of as we have 
to take the variational derivative with respect to the metric tensor which is off-shell 
operation. With these remarks Eq. (77) takes on the following form 

C = • (84) 

We shall calculate (84) later on for an ideal fluid and a scalar field in the case of the 
FLRW universe governed by dark matter and dark energy. 

The right side of Eq. (74) contains the effective stress-energy tensor consisting of 
two contributions 



where 



A 






“I" ‘^u 



HV7 



1 bC7 



is the stress-energy tensor of the bare perturbation, and 



^ _ 2 



(85) 

( 86 ) 



(87) 



is the stress-energy tensor associated with the dynamic field perturbations 1) and . 

We notice that tensor is defined as a variational derivative with respect to the 
background metric, Hence, it differs from 

2 bcy 

7’a«=^=— (88) 

which is defined in terms of the variational derivative with respect to the full metric 
g\iv = g^iu + + • • • Relation of the two tensors can be found by making use of 

equations 



dgP^ 




1 ■ 






- \r''9aP 



(89) 



( 90 ) 
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and 



(5 



Qgpa ^ 
dga(3 Qgfiu §gpa • 



§gOil3 



(91) 



It yields an exact relation 




where the trace of the stress energy tensor is defined as T = Relation (92) 

can be inverted leading to 



Tfjiu — ^ ^^a(3 ? (93) 

where T = 

Tensor can be split in two algebraically-independent parts 




(94) 



where is the stress-energy tensor of pure gravitational perturbations while 
r^zy is the stress-energy tensor characterizing gravitational coupling of the matter 
field (j)^ with the gravitational perturbations 

For example, in the second-order approximation, when = C 2 , the corre- 
sponding stress-energy tensors are given by equations 



where is defined in (101). 

As soon as the differential operators and the source terms in Eq. (74) are specified, 
it can be solved by successive iterations. It requires decomposition of the perturba- 
tions in the post-Friedmanian series 



where the terms with indices ^ = 1,2,3,... represent the successive approximations 
of the corresponding order of magnitude. We conjecture that the series (97), (98) 
are analytic and convergent for a sufficiently small magnitude of the perturbations. 
However, this requires a special mathematical study. 





1 J 



167tV^^ 5g^^^ 



_L _L 




(97) 

(98) 
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The iteration procedure starts off the substitution of the unperturbed values of 
= 0 to the right side of (74) and finding the linear perturbation The 
solution is substituted back to the right side of (74), which is solved again to find 
() 2 ^, and so on. However, we need additional set of differential equations for the 
perturbations of the matter fields (j)^ . 



3.6 Field Equations for Matter Perturbations 



Equations for the matter field perturbation (j)^ are derived from the Lagrangian (71) 
by taking the variational derivative with respect to (j)^. Since the Lagrangian 
does not depend on (j)^ it drops out from calculations. Moreover, we assume that the 
background field equations (43) are satisfied. Thus, the stationarity of the Lagrangian 
(70) with respect to the perturbations (j)"^ yields 



^£dyn 



= 0 . 



(99) 



After making use of the second relation in (57), Eq. (99) recasts to 



= SttE^, 



where the linear differential operator 

167T 5 



ttM 

^ A = ■ 



(■ 









) 



and the source density 



= 



2 

<5$^ ■ 



( 100 ) 



( 101 ) 



( 102 ) 



All linear with respect to 1)“^ and terms are included in the left side of Eq. (100) 
while the non-linear terms have been put in S“. More explicit form of the operator 
can be obtained with the help of (82), (83) that results in 



r'M 



Stt S 





(103) 



Lurther specification of requires a particular model of the background matter 
Lagrangian which will be discussed in the following section. 

Equations of motion of the field are obtained after taking the variational 
derivative from the Lagrangian (71) with respect to the variable 0^. Because the 
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only part of the Lagrangian which depends on this filed, is , the equations are 
reduced to 



5C^ 



= 0 . 



(104) 



Particular form of this equation depends on a specific choice of the Lagrangian of the 
bare perturbation. In the lowest order approximation Eqs. (100) and (104) describe 
evolution of the dynamic variables and on the unperturbed cosmological 
background. The next-order approximations take into account the back reaction of 
the background on these fields. 



3.7 Gauge Invariance of the Field Equations 

Gauge invariance of the dynamic perturbations is an important geometric property 
that allows us to distinguish physical degrees of freedom of gravitational and matter 
fields from the spurious modes generated by transformations of the local coordi- 
nates on manifold. Any self-consistent perturbation theory must clearly separate the 
coordinate-dependent effects from physical perturbations which do not depend on 
the choice of coordinates. The gauge transformation is understood in the sense of 
the exponential mapping of the background spacetime to itself that is induced by a 
non-singular vector flow with a tangent vector that is a generator of a 

(finite) coordinate transformation 

= exp + r + + • • • • (105) 

The mapping of any geometric object Q = Q{x^) io another one Q' = Q\x^) 
being induced by transformation (105) defines the gauge transformation of Q that is 
given by the exponential Lie transform 

e'(x“) = (exp£|) 2(x“) = 2(x“) + £^e(x“) + + . . . , (106) 

where the Lie derivative has been defined in Sect. 2.2, = 

£^£^£^, and so on. 

The gauge transformation of the metric tensor, and the matter fields, ©^ , 

implies that their background values do not change under the gauge transformation, 
only the dynamic perturbations, 0^, 0^ change. Hence, the gauge transformation 
(106) applied to these variables read 

(j)'^ =(j)^ + (exp - 1) (<t>^ + , 



(107) 

(108) 
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Let us consider the gauge transformation of the Lagrangian (51) induced by the 
gauge transformations of its arguments in more detail. The transformed Lagrangian 
C' has the same functional form as C that depends now on the new values of 
the dynamic variables, C! = L 0^ + 0'^). Replacing the 

primed variables by making use of Eqs. (107)-(109), yields 

= C l^exp exp + (p^), exp £^ ^0^ + j . (110) 

This equation can be further transformed by making use of the following commuta- 
tion relation [70] 

£[exp£^(g''^ + fl''0,exp£4(4>'^ +0'^),exp£j (©^ +6>®)] (111) 

= exp £^C <t>^ + 0® + , 

that is valid modulo total divergence which is inessential in the variational calculus. 
Expanding the right side of (111) in a Taylor series, like in (106), and taking into 
account that £^C = da yields the gauge transformation of the Lagrangian, 

c = c + dc,{ec) (112) 

+ (C“£)) + [Cd/s (r^))) + . . . , 

where the second, third, and all other terms in the right side of this infinite series 
represent a divergence. The divergence vanishes when one takes the variational deriv- 
ative from it and, hence, it can be omitted from the action functional S given in (50). 
The conclusion is that both the action S and the Lagrangian (56) are gauge invariant 
with respect to the gauge transformation of their arguments. This assertion is valid 
both on- shell and off- shell. 

On the other hand, the effective Lagrangian (71) is gauge-invariant only on- shell 
that is only when the background equations of motion (43), (44) are satisfied. Indeed, 
the effective Lagrangian can be represented as a difference = C — C\— C. After 
making the gauge transformations (107)-(109) of the dynamic variables, we get a 
new effective Lagrangian = C' — £\ — C. The difference is 



= 8C-\- (exp £^ — l) 






5C 









(113) 



where the terms being enclosed in the square brackets, vanish on-shell due to the 
background equations of motion (43), (44). Therefore, £ is a 

total divergence as follows from (112). Hence, is gauge-invariant on-shell. 
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The gauge invariance of the Lagrangian suggests that the Einstein equations (74) 
for metric perturbations are gauge invariant as well. It is straightforward to prove it 
by direct but otherwise tedious calculation which technical details are given in [70]. 
Gauge transformations (107)-(109) applied to the Einstein equations (74) transform 
them as follows 

= a;, + + exp (1 14) 

where function F" vanishes on-shell similar to (113). Therefore, if one assumes that 
the field equations are valid at least in one gauge, the last term in the right side of 
(114) vanishes, thus, proving that the field equations are valid in any other gauge. 



4 The Lagrangian Formalism in Cosmology 



We shall use the cosmological model that is in a close agreement with modem 
observational data. In this model the background manifold represents the spatially 
homogeneous and isotropic ELRW universe which temporal evolution is governed 
by a doublet of matter fields, We identify 0^ = 0 with the Clebsch 

potential of an ideal fluid with pressure, and = 4^ — with a scalar field having the 
potential W = W(4^) depending only on the scalar field 4^. The ideal fluid models 
a self-interacting dark matter [75] while the scalar field describes dark energy in the 
form of quintessence [76]. The dark matter without self-interaction is included in 
our theoretical scheme as a pressureless ideal fluid. 

The overall Lagrangian of the model under consideration is given by Eq. (51) with 
the Lagrangian of the matter consisting of two non-directly interacting pieces 

C^ = C^ + C^, (115) 

where is the Lagrangian of dark matter, and is the Lagrangian of dark energy. 



4.1 Lagrangian of Dark Matter 

Dark matter is modelled as an ideal fluid that is characterized by four thermodynamic 
parameters: the rest-mass density pm, the specific internal energy per unit mass flm, 
pressure pm, and entropy where the sub-index ‘m’ stands for the dark ‘matter’ . We 
shall assume that the entropy of the ideal fluid remains constant (isentropic motion) 
and dissipative processes are neglected. The total energy density of the fluid 

em = Pm(l + nm). ( 116 ) 



One more thermodynamic parameter is the specific enthalpy of the fluid defined as 
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Mm — 



H“ Pm 
Pm 



1 + 0^+—. 

Pm 



(117) 



We shall consider barotropic fluid which thermodynamic equation of state is given 
by equation pm = Pmipm, Hm), where the speciflc internal energy Om is related to 
pressure by the first law of thermodynamics 



r/rim + Pmd 




= 0 . 



It is used to derive the following thermodynamic relationships 



(118) 



dpm — pm^Pm^ (119) 

d^m — Mm^Pm? (120) 

which are derived directly from (116)-(118). Equation (119) immediately tells us 
that the partial derivatives 



dpm 

dpm 

dpm 



= Pm, 
— Pm- 



(121a) 

(121b) 



Equation (121) elucidate that all thermodynamic quantities are functions of only one 
thermodynamic potential. We accept that this potential is the speciflc enthalpy pm- 
Equation of state, relating pressure and the energy density, becomes pm = Pm(^m), 
and it is also an implicit function of the same thermodynamic variable p^ because 

^m = ^m(Pm)- 

Partial derivatives of the thermodynamic quantities with respect to p^n can be 
calculated by making use of (119), (120), and the equation of state pm = Pmi^m) 
giving rise to definition of the (adiabatic) speed of sound propagating in the fluid 



dpm _ ^ 

d€m ’ 



( 122 ) 



where the partial derivative is taken under a condition that the entropy, Sm, does 
not change. Notice that the speed of sound in dark matter is not constant in the most 
general case of a non-linear equation of state. In this case, the speed of sound depends 
on the thermodynamic potential pm through the equation of state, that is = Cs(ps)- 
It is also worth emphasizing that Eq. (122) is valid for any wavelength of sound wave 
in the ideal fluid, not only for short wavelengths. 

Other partial derivatives of the thermodynamic quantities can be calculated with 
the help of the equation of state and derivatives (121), (122) which can be inverted, if 
necessary, because all thermodynamic relations in the ideal fluid are single-valued. 
We have, for example. 
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C dpx^ C /9m 

^ ^ 9 (123) 

where all partial derivatives are performed under the same condition of the constant 
entropy. 

Theoretical description of the ideal fluid as a dynamic system on space-time 
manifold is given the most conveniently in terms of the Clebsch potential, O which 
is also known as the velocity or the Taub potential [77]. The Clebsch potential is 
a continuous scalar field which can be taken as an independent dynamic variable 
characterizing the motion of the fluid. This description is complimentary (dual) to 
the Lagrangian formalism of the ideal fluid based on the coordinates and four- velocity 
of the fluid particles. 

In the case of a single-component fluid the Clebsch potential is introduced by the 
following relationship 

Pm^a — (124) 

where is the four- velocity of the fluid, = ga(3^^^ is the proper 

time of the fluid element taken along its world line, and we denote Oq, = O q, from 
now on. Equation (124) solves the relativistic Euler equation of motion of the ideal 
fluid [78]. The four- velocity is normalized, = —1, so that the specific 

enthalpy can be expressed in the following form 

Aim = (125) 

One may also notice that 

gLm = (126) 

The Clebsch potential O has no direct physical meaning as it can be changed to 
another value O ^ O' = O -h O such that the gauge function, O, is constant along 
the worldlines of the fluid in the sense that = 0- 

The Lagrangian of the ideal fluid is usually taken in the form of the total energy 
density, = ^/^Cm [79]. However, this form is less convenient for applying 
the Lagrangian formalism on manifolds as it does not contain the kinetic energy 
(~^«/^Oc^O/ 3 ) of dynamic field variables directly. Eor this reason, the Lagrangian 
of the ideal fluid in the form of pressure, is used in the present 

paper. It differs from the Lagrangian in the form of energy by a total divergence 
(see [46, pp. 334-335] for more detail) which is unimportant as it vanishes when the 
variational derivative from the Lagrangian is taken. The Lagrangian in the form of 
pressure includes the kinetic energy term 



= y^(em 



PmMm) — 







) 



(127) 
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Metrical stress-energy tensor of the ideal fluid is obtained by taking a variational 
derivative of the Lagrangian (127) with respect to the metric tensor, 



T-’in 

''a/3 — 



2 50 ^ 



(128) 



In our field-theoretical approach to the description of the ideal fluid the metric tensor 
enters all thermodynamic quantities only through the specific enthalpy in the form of 
(125). Therefore, taking the variational derivative in (128) with respect to the metric 
tensor can be done with the help of the chain rule (23) and the results are given in 
Appendix section “Variational Derivatives of Dark Matter Variables”. Finally, the 
stress-energy tensor (128) is 



Tag - (£m + Pm) WaWg + PmPap, (129) 

that is a standard form of the stress-energy tensor of the ideal fluid [79]. 



4.2 Lagrangian of Dark Energy 



The Lagrangian of dark energy is taken in the form of a quintessence of a scalar 



fields 



(130) 



where W = IT (^) is a potential of the scalar field and we denote from 

now on. We assume that there is no direct coupling between the Lagrangian of dark 
energy and that of dark matter. They interact only indirectly through the gravitational 
field. Many various forms of the potential IT are used in cosmology [80] but at the 
present paper we do not need to specify it further. 

The metrical stress-energy tensor of the scalar field is obtained by taking a varia- 
tional derivative 







2 6/:^ 



7=5 <^ 5 “^ ’ 



(131) 



that yields 



‘-a(3 ' 



- 9aP 






(132) 



One cmiformally reduce tensor ( 132) to the form similar to that of the ideal fluid by 
making use of the following procedure. First, we define the analogue of the specific 
enthalpy of the scalar field “fluid” 



Pq = 



(133) 
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and the effective four- velocity, of the “fluid” 

flqVa = (134) 

The four- velocity i)“ is normalized to gapv'^v^ = —1. Therefore, the scalar field 
enthalpy /iq can be expressed in terms of the partial derivative from the scalar field 

/iq = (135) 

We introduce the analogue of the rest mass density pq of the scalar field “fluid” by 
defining, 

/9q = Mq = (136) 

As a consequence of the above definitions, the energy density, 6q and pressure pq of 
the scalar field “fluid” can be introduced as follows 

We notice that relation 

^q + /^q 

H ’ 

Pq 

between the specific enthalpy pq, density pq, pressure pq and the energy density, Cq, 
of the scalar field “fluid” formally holds on the same form (1 17) as in the case of the 
barotropic ideal fluid. 

After substituting the above-given definitions of various “thermodynamic” quan- 
tities into Eq. (132), it formally reduces to the stress-energy tensor of an ideal “fluid” 

^a/3 = (^q + + P^Pag- (140) 

It is worth emphasizing that the analogy between the stress-energy tensor (140) of the 
scalar field “fluid” with that of the barotropic ideal fluid (129) is rather formal since 
the scalar field, in the most general case, does not satisfy all required thermodynamic 
equations because of the presence of the potential W = W(4^) in the energy density 
Cq, and pressure pq of the scalar field. The dark energy is physically different from 
dark matter! 



-pqPq + W(4^), 
- ^(^)- 



(137) 

(138) 
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4.3 Lagrangian of the Bare Perturbation 

The Lagrangian of bare perturbation that appears in (51), can be chosen arbitrary. 
We assume that the matter of the bare perturbation is described by a variable © 
which may be a scalar or tensor field. The Lagrangian of the bare perturbation is 
given by (© , . Stress-energy tensor of the bare perturbation, 

is defined in ( 88 ). Tensor is a source of the bare gravitational perturbation 
of the background manifold which generates the large-scale structure of the universe. 
Its structure should be specified depending on a particular physical situation. In this 
paper we focus on derivation of hydrodynamic equations of motion of an ideal fluid on 
background manifold with taking into account the manifold back reaction. Therefore, 
we assume 



where e, p are the energy density and pressure of the fluid comprising the bare 
perturbation, and is its four- velocity normalized to UaU^ = —1. It is worth 
emphasizing that the four- velocity of the bare perturbation is not a part of the back- 
ground fluid flow and is decoupled from it in the first approximation. Notice that 
the stress-energy tensor defined in (85), is not equal to We have derived 
relation between the two tensors in (92) and (93). 



4.4 Background Manifold 

All geometric objects on background cosmological manifold M will be denoted 
with a bar over the object. The FLRW metric on the background manifold is given 
in (49). It is convenient to introduce global isotropic coordinates = (X^, X^) by 
changing the cosmic time T to the conformal time rj = X^ via differential equation 
dT = a(r])dr]. The FLRW metric tensor in the isotropic coordinates reads 



where a cosmological scale factor, a(r]) = R[T(r])]. The conformal metric = 
(— 1 , fij), where the spatial part of the metric 



depends on the curvature of the spatial hypersurfaces, ^ = {—1, 0, -hi}. In case, k = 
0, the conformal metric f is reduced to the Minkowski metric, 77 ^^^. Four- velocity 
of the Hubble flow in the isotropic coordinates is = dX^jdT = (< 3 “^ , 0, 0, 0, 0). 

Due to the maximal symmetry of FLRW spacetime, all background geometric 
objects depend only on time p in the isotropic coordinates. We can, and will use 



Ta[3 — + P) UaUfj -h ppap, 



(141) 



P/J.U ^ ('^)f//Z^9 



(142) 




(143) 
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arbitrary coordinates = (x^, x^ ) which are connected to the isotropic coordinates 
by a smooth diffeomorphism x°^ = x^(X^). Partial derivative of a background 
geometric object, Q = 2 ( 77 ), in the arbitrary coordinates is given by 



Q' 

Q,a = Ua = -QUa, (144) 

a 

where is four- velocity of the Hubble flow in arbitrary coordinates. Equation (144) 
applied to the scale factor, yields a^a = = —Hua^ and the partial derivative 

from the conformal Hubble parameter = —'Hua- 

Einstein’s field equations on the background manifold are given by (44)-(46) 
which yield two Eriedmann equations for the temporal evolution of the scale factor a. 



2H + 3H^ 



87T _ k 

3 ’ 


(145) 


r. ^ 

-87TP - ^ 


(146) 



a 



where e and p are the effective energy density and pressure of the background matter. 
A consequence of the Eriedmann equations (145), (146) is an equation 



H = -47t (e -h p) + 




(147) 



relating the time derivative of the Hubble parameter with the sum of the overall 
energy density and pressure of dark matter and dark energy 



e + p = Pm/im + PqPq. (148) 

The equation of continuity for the rest mass density pm of the background dark 
matter is (43) for ^ <i>. It reads 

(149) 

that is equivalent to 

Pm\a ~ — 0- (150) 

The background equation of the conservation of energy of dark matter is 

^m|o; “ (^m H” Pm) = 0, (151) 

where we have employed the definition of energy (1 16), and Eqs. (118), (150). 
Background equation for the scalar field 'T is (43) for ^ It reads 
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After making use of the definition of the background enthalpy of the scalar field 
/iq = an equality /iq = pq, and definition (137) of the specific energy eq of 

the scalar field, the Eq. (152) can be recast to 

eq|a - (eq + ^q) Ma = 0, (153) 

that is similar to the hydrodynamic equation (150) of the conservation of energy of 
dark matter. Because of this similarity, the second Friedmann equation (146) is not 
independent and can be derived directly from the first Friedmann equation (145) by 
taking a time derivative and applying the energy conservation Eqs. (151) and (153). 

The equation of continuity for the density pq of dark energy is obtained by dif- 
ferentiating definition (136) of pq, and making use of (152). It yields 



or, equivalently. 






dW 

1 ^’ 



Pq\a ^HpqUa 



dW . 



(154) 



(155) 



which shows that the density pq is not conserved. There is no violation of physical 
laws here since (155) is simply another way of writing (152) and dark energy is not 
thermodynamically equivalent to dark matter. 



4.5 Perturbations of Dynamic Variables 



In the present paper, FERW background manifold is defined by the metric which 
dynamics is governed by the two fields — the Clebsch potential O of dark matter 
and the scalar field 4^ of dark energy. We assume that the background metric and 
the fields are perturbed by the presence of a bare matter field 0. Perturbations are 
considered to be sufficiently small so that the perturbed metric and the matter fields 
can be split in their background values and the corresponding perturbations, 

dap = 9ap + ^aP^ 0 = 0 + 0 , 0 = 0 + 0 . ( 156 ) 

These equations are exact. Perturbation of the contravariant component of the metric 
is not independent and is determined from the isomorphism 
and is given by 



gal3 ^ -a/3 _ ^a/3 ^ ^a^^7/3 + . . . ^ 



(157) 
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where the ellipses denote terms of the higher order. 

We consider perturbation of the metric — that of the potential of dark matter — 
0, and that of the potential of dark energy — as weak with respect to their corre- 
sponding background values ga[ 3 , and 'E, which dynamics is governed by equa- 
tions that have been explained in Sect. 4.4. Because the field variable 0 is the source 
of the bare perturbation, we postulate that its background value is equal to zero. The 
perturbations Xap, 4>, and ip have the same order of magnitude as 0. More convenient 
dynamic variable of gravitational field is a contravariant (Gothic) metric 

(158) 

The covariant Gothic metric is defined by that yields Qap = 

gapls/^- The Gothic metric is expanded around its background value, = 
as shown in (53). Calculations also prompt to single out from 
and operate with a variable 

ua(5 

= ~^=. (159) 

v-^ 

Tensor indices of the metric tensor perturbations, etc., are raised and lowered 

with the help of the background metric, for example, l^p = gafiggiyl^^- The field 
variable relates to the perturbation of the metric tensor as follows 



(160) 

where k = ^ = g^^ and ellipses denote terms of higher orders in 

Perturbations of four- velocities, and entering definitions of the stress- 
energy tensors (129), (140), are fully determined by the perturbation of the metric 
and those of the potentials of dark matter and dark energy. Indeed, according to 
definitions (124) and (136) the four- velocities are defined by the following equations 



Wa 




Va 




(161) 



where /im = and /iq = in accordance with (125) 

and (133) respectively. We define perturbation of the covariant components of the 
four- velocities as follows 



Woi = Ua -\- (162) 

where the unperturbed values of the four-velocities coincide and are equal to the 
four- velocity of the Hubble flow due to the requirement of the homogeneity and 
isotropy of the background FLRW spacetime that is = u^. Hence, we have 
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P^q 



(163) 



Making use of (161) and (163) in the left side of definitions (161), and expanding its 
right side by making use of expansions (156) and (157), yield 



SWa = - lq«c 

Mm 2 



SVa = - l:C\Ua, (164) 

Mq 2 



where Pa (3 = Qa^ + Wq. W /3 is projector onto the hypersurface orthogonal to the Hubble 
flow, and 

q = + -, (165) 

is projection of the metric tensor perturbation on the Hubble flow. 



4.6 Field Equations 

4.6.1 Gravitational Field 

Field equations for metric tensor perturbation are given by the Euler-Lagrange equa- 
tions (74) where the operator 



C = Kv + (166) 

consists of two pieces corresponding to dark matter (index ‘m’) and dark energy 
(index ‘q’). These pieces are defined in accordance with (77) or, more exactly. 



= 

^ fiv — 



= 



167T J 
16tt S 





(167) 


T" - 1 • 
SqP'^ 6^ ) 


(168) 



Taking variational derivatives from various functions entering the Lagrangians 
and is rather straightforward and follows from their definitions, the chain rule (23), 

and a set of variational derivatives of thermodynamic quantities given in Appendix 
section “Variational Derivatives of Dynamic Variables with Respect to the Metric 
Tensor”. Making use of the Lagrangians (127) and (130) taken on the background 
manifold and calculating variational derivatives in (167), (168), we obtain 
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= -47r(;>m - em)V + STTyOm {un(j)u + Uv(j>ij, - guvU^^’a) 



/ 


C^\ / 1 \ 


+ STT/Om 1 


1 - ^ j (pa- ^PmC\ j 



= -47T {pq - £q) l^,y + 87Tpq “ §txuU°‘'tpa) 



dW , 



(169) 



(170) 



where pq = /iq = in accordance with definition (136) projected on the back- 
ground manifold. The dark energy potential function, W = IT('T), is arbitrary. 

Substituting (169), (170) along with (81) to the left side of (74) yields field equa- 
tions for gravitational perturbations in a covariant form [48, Eq. 161] 

+ + 167tA^,, (171) 

where is the gauge vector function. They can be drastically simplified by 

choosing the gauge condition imposed on the variable in the following 

form 



A^ = —IHl^^Up -h 167T (pm0 + PqV^) (172) 

This gauge condition cancels a significant number of terms in the field equations (171) 
and allows us to decouple equations for different components of . Picking up the 
isotropic coordinates of the Hubble observers we bring the gravity field equations to 
the following form [48] 



-h 2Hq^o -h 4kq — 47t 







87t (Aoo + ^kk) (173a) 




- 16™^ 

+ 32™// [pm4> + Pq'lp) , 



□/oi + IHloifi + 2kloi = 167rAo,-, (173b) 

+ 2{H - k) = 167tA<,-,>, (173c) 

□/ + 2///,o + 2(//+2qZ = IbTrAj:*:. (173d) 



where we denoted the wave operators \3q = and Other 

notations are 4>q = </>,o, q = (ho + kk) /2, I = kk = hi + hi + h^, l<ij> = 
lij — and the same index notations are applied to the effective stress- 

energy tensor 
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4.6.2 Dark Matter 



Evolution of dark matter perturbation is described by the perturbation of the Clebsch 
potential (j). Equation for 0 is derived from a general equation (100) is, in case of 
dark matter, reads 






(174) 



All terms in this equation can now be explicitly written down because the Lagrangian 
of dark matter is fully determined by (127). The linear differential operator is 
derived from (103) which yields 



^ . (175) 



where 



The source density 



2 



(176) 



(177) 



and we shall calculate it explicitly in next section. 

Calculation of variational derivative in (175) requires taking variational deriva- 
tive from the density pm of the ideal fluid with respect to the background value of 
the Clebsch potential O. The density implicitly depends on the potential through 
the specific enthalpy, pm = p (Pm) which depends only on the derivatives Oq, of the 
potential. Hence, 



^Pm d dpm ^ \ 

dx^ di>a dx^ \dpni di>a) 



(178) 



where the partial derivative of the density with respect to the specific enthalpy is 
calculated with the help of (123) by making use of equation of state of the ideal fluid, 
and the partial derivative 



dpm -a 
= u 

d^a 



(179) 



It reveals that 

F^ = (180) 



where the vector field 

^ + (l - ^) ( - Ipmq) m". 

Mm V t’s/ \Mm 2 / 



( 181 ) 
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Taking covariant derivative in (180) brings about the field equations for (j) 

(182) 

+ ^1 - (u''uf^4>alS - 

(u'^cPc, - ^Mmq) = S'", 

where the very last term accounts for the fact that the speed of sound is not constant in 
inhomogeneous medium [78]. Indeed, the speed of sound, c§, relates to other thermo- 
dynamic quantities by equation of state making the speed of sound a function of the 
specific enthalpy. Hence, = Cs(/im)- Covariant derivative from the speed of sound 
is Cs|a = (dcs/dflm) yfimicf, where the covariant derivative yfimicf = (dfim/dpm) Pm\a 
and, according to equation of continuity, pjn\a = yields 

/ c^\ dlnc^ 

1-^ (183) 

V / Icf Cfpm 

that explains the origin of the last term in (182). 

After imposing the gauge condition (172), Eq. (182) is reduced to 

+ 167T/ini (pm0 + Pqi^) ~ 2/im^q (184) 





4.6.3 Dark Energy 

Calculation of the field equation for dark energy perturbation follows similar path 
like in previous subsection. The field equations follows from (100), and they are 

= 87tE^, (185) 

where and are determined by the Lagrangian of dark energy (130). Making 
use of this Lagrangian in (103) we obtain 

^ A q + , (186) 



where 
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The source density 



i‘^ = 2 



( pn M ) I “1“ — - 

’\» Q\li 



S^l = 



2 



(187) 



(188) 



and we shall calculate it explicitly in next section. 

According to Eq. (130), the Lagrangian density of the scalar field depends on 
both the field ^ and its first derivative, For this reason, the differential operator 
is not reduced to the covariant derivative from a vector field as the partial derivative 
of the Lagrangian with respect to ^ does not vanish. We have 



= Stt 




I dW d^W\ 

~ ) 



(189) 



where I = g'^^lap, and vector field 



Z“ = - pal'^^up, (190) 

where we have used equation 'I'|c = —u^'^\pua = —pqUa- Taking covariant deriv- 
ative in (189) yields the field equations for ip 

( dW\ d^W 
where Eq. (172) has been used along with the equality pq = /iq. 



5 Stress-Energy Tensor of Dynamic Perturbations 



The effective stress-energy tensor is defined as a variational derivative (75) taken 
from the effective Lagrangian (71). According to (85), it consists of two parts — the 
stress-energy tensor of matter of the bare perturbation 1^^^, and the stress-energy 
tensor of dynamic perturbations T^jp. We shall keep tensor unspecified as long 
as theory permits and focus on calculation of which also consists of two parts, 
and according to (287). Tensor is the stress-energy tensor of gravitational 

field perturbations (95). Tensor is the stress-energy tensor due to the coupling 
of matter perturbations on the background manifold and the gravitational field per- 
turbations. General formula for calculating is given in (96). In case of FLRW 
universe gowerved by dark matter and dark energy, tensor is linearly split in two 
counterparts 
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(192) 



where and describe contributions of dark matter and dark energy respectively. 



5.1 Stress-Energy Tensor of Gravitational Field Perturbations 

Stress-energy tensor of gravitational perturbations, has a universal presentation 
on any manifold. We begin its calculation from definition (95) and notice that vari- 
ational derivative is insensitive to terms which are total divergences. Hence, it is 
reasonable to single out a total divergence in the argument of variational derivative, 
jrG = entering definition (95) of the gravitational stress-energy 

tensor. We substitute (81) for to calculate and, then, employ the Leibniz rule 
to single out the total divergence from the products of two functions. We arrive to 

^ - if)'’"' V ia + '7|A + div, (193) 

where , () = gpcrY^, I = and div is a total divergence that 

can be discarded from the calculation of the variational derivative. 

Next step is to apply the covariant definition (19) of variational derivative to (193) 
in definition (95) of iag. After accounting for transformation (34), definition (95) 
can be written as follows 



leniag = 



I ^ ^ 

I — =Q(y,ii9 13y r- — 9 

\~9 og^iy 



(194) 



that conforms with the lower position of indices in variational derivative (19). It is 
worthwhile to remind the reader that we consider perturbation as independent 
variable which has been used in derivation of (95). It means that the partial derivative 



dg^y 



(195) 



The covariant components of the gravitational perturbation, con- 

tain the background metric tensor and cannot be considered as independent from the 
background metric tensor. We have 



d^af] 

dg^y 



-w^ggx^ + 

^9fiy 



dgpx 

dQfiu 









(196) 
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Let us consider now the dependence of the covariant derivative on the metric 
tensor. We notice that the perturbation is a tensor density of weight — 1 . Therefore, 

its covariant derivative has one more term as compared with that of a tensor of a 
second rank. More specifically, 

(197) 



It reveals that the derivative depends merely directly on the Christoffel symbols and 
is independent of the metric tensor Hence, the partial derivative 

= 0. (198) 

It confirms that the metric tensor and the Christoffel symbols are true independent 
variables along with the tensor density and its covariant derivative 
Equation (193) given in terms of the independent variables, reads 

^ , (199) 

where we have discarded the total divergence. 

Variational derivative (19) from (193) engages the partial derivatives with respect 
to the background metric tensor and those with respect to the Christoffel symbols. 
Partial derivative with respect to the metric tensor yields 

Partial derivative with respect to the Christoffel symbols taken from is calcu- 
lated with the help of (197). It gives 

Pi per 

( 201 ) 

After making use of this formula, the partial derivative of with respect to the 
Christoffel symbols results in 




1 

\/~9 



= + 2m\p\a) - 2lpJ^^P^P 



( 202 ) 
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It allows us to calculate the linear combination of the partial derivatives entering 
definition of variational derivative (19), 






dJ^ 














V'l 



dr<^ 



(203) 



flp / 



= 2 / — I 

p p p ^ 



After replacing (200) and (203) into variational derivative (194) defined by the rule 
(19), the stress-energy tensor of gravitational field takes on the following form 



167tV = (204) 

+ lpcr\plp^'^ + —lp(j\pl^‘^\u + ~ '^hphi' 

+9pu 

+ - i.pp^ip - ipppp\u - P%.\p - ■ 

It apparently depends on the second derivatives of the gravitational perturbation, 
Significant number of the second derivatives can be eliminated on-shell by making 
use of the co variant field equation (171). We take the covariant derivative (204) 
and express the commutator of the second-order derivatives from the metric tensor 
perturbation in terms of the Riemann tensor 



PpWlS = Pp\^a - PpR^jali + P^R\a!3- (205) 

A useful consequence of this equation is 

l^p\aa = Apia + l^pRaa + (206) 

where Straightforward but tedious rearrangement of the second-order 

derivatives from the metric tensor perturbations with the help of (205), (206) allows 
us to put (204) into the following form 
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167tV = - l^'^\Jpp\a (207) 

“t“ \u lpv\pa 

+ \gp. + \i%) 

H”2/^(^A^)|p — l^yA^\p — lpy\pA^ 

+ 87t [4/'>(^©^)p - V© - - g ^, { lf>^@pa - ^©^] 

+ '^1^ pl^ lyRpa + {pRu)af3p 

where 

©a/3 = ^«/5-^(C/3 + ^«/3)- (208) 

and the spur, 0 = Notice that the on- shell form (207) of the stress-energy 

tensor ipj^ for gravitational field depends on the choice of the gauge function A^ as 
well as on the value of the Riemann tensor of the background manifold. 



5.2 Stress-Energy Tensor of Dark Matter Perturbations 

The part of the stress-energy tensor describing the dark matter perturbation is given 
in (192) by that, according to (96), is calculated as a variational derivative 

1 

= ~^=gapgpiyj^, (209) 

v~g ^gpiy 

where 

^ ^papp^ _ 1 ^ (21Q) 

Individual terms entering the right side of (210) are taken from (169) and (180). We 
single out the total divergence and brings (210) to the following form 

jrm ^ ^^papm _ ^ ( 21 1 ) 

where the total divergence has been dropped off, (j)a = (t)\a^ and is given in (181). 
After reducing similar terms Eq. (211) takes on the following form 
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jr™ = ~ 3pm/"^Ma0/3 j (212) 

-AiTyf^giPr^ - em) 

- I" ■ 

Mm V [ 2 2 J 



where again we have used notation (pa = P\a- 

Taking variational derivative in (209) is rather straightforward but tedious proce- 
dure. Because the Lagrangian depends neither on the Christoffel symbols nor on 
the curvature tensor, the variational derivative (209) is reduced to a partial derivative 
with respect to the metric tensor / dg^jj. Calculation of the partial 

derivative is done with the help of the chain rule and equations in Appendix section 
“Variational Derivatives”. It yields the stress-energy tensor of dark matter 



' !iv 






pm 
2//m 






) 



d In c, 



X 1^ («“?!'«) 



pm Pm 



cl dprn _ 



^2 3 



pm 

4/in 



-pmOiU Pa + 
,2 



(213) 



1^0"^ 0a + ^1 — (w^0a) j : 

— pmPm ^1 “ ^ Qiii' T ~ 2 ^ 



~ 2^^ m ^m) I 



where we have used (1 17) to make a replacement e^-\- p^ = pmPm- 



5.3 Stress-Energy Tensor of Dark Energy Perturbations 

The part of the stress-energy tensor describing the dark energy perturbation is given 
in (192) by that, according to (96), is calculated as a variational derivative 



736 



S.M. Kopeikin and A.N. Petrov 



q 1 

(214) 

where 

^ (215) 

Individual terms entering the right side of (215) are taken from (170) and (189). We 
single out the total divergence and bring (215) to the following form 

^ - 87ry=^^/i„Z“ - 47t^4> - (216) 

where the total divergence has been dropped off. More explicitly, 



3, dW 



+ -llp^+tp^ - 
2 d4i 



= — Stt^^ 

F^T^yThWi^) - , 



d'^W 



- 3Hql°'^Ua'tl’l3 



(217) 



where Taking variational derivative with respect to g^jy, we obtain the 

stress-energy tensor of dark energy perturbation 






1 1/ od^W\ _ 3 dW 

- - y^p°‘^pa + -tp j 5^12 - 4 

+ W('i') i 9,.u 



(218) 



6 Post-Newtonian Equations of Motion in Cosmology 
6.1 General Formulation 

Let us consider a general manifold with the effective Lagrangian 

(^A qB^ depending on five independent variables and their derivatives which 
we intentionally omitted from the argument of the Lagrangian to avoid a cumber- 
some notation. We have proved in Sect. 3.7 that the effective Lagrangian is 
gauge-invariant modulo a total divergence. The gauge invariance of suggests 
that its Lie derivative must be also nil modulo a total divergence of a vector field: 
= da Because the total divergences do not affect field equations we drop 
all of them out of the subsequent equations. 

We compute the Lie derivative of the effective Lagrangian by making use of (29) 
that reduce calculation of the Lie derivative to that of variational derivatives modulo 
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a total divergence. After dropping off the divergence, we have 















r/’eff 

(5d)A « ^ 



5g°‘^ 



6(j)^ 



(219) 



Field equations (72), (99), (104) describing evolution of perturbations 0^, 0^ on 

the background manifold exterminate the last three terms in the right side of (219). 
The first term in the right side of (219) can be written down as follows 

x/’eff 

( 220 ) 

where we have used definitions (75), (85)-(87), and equation 

The Lie derivative of the field depends on its geometric properties. In a 
particular case of tensor density the Lie derivative is given by 

(26) that can be written symbolically as follows 

£^^^ = (221) 



where 



'ui...Uq\a’ ajS 



= K^^9a9, 



and 



= m6t 



V / 



-c 




S^l£2...l£p 


■■-C 


i^T 


...I£p-l 


V y 






V / 1^1 


...^q 




(i.; 








.{Ip 




OlV2..Mq 






■ l^q-ia 



( 222 ) 



Making use of definition (102) and (222) we can present the second term in the right 
side of (219) in the following form 












Substituting (220), (223) to the right side of (219) results in 



£^£^^ = (^-Aap + l^K^p) r"". 



(223) 



(224) 



Applying the Leibniz rule to change the order of differentiation in the terms depending 
on , we can recast (219) to the following form 
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where the vector field 






aJ^ - 1 r + y=^lT/3'“, (225) 






(226) 



The last term in (225) is reduced to the total divergence of a vector density 



^ , (227) 

where Wa • Therefore, the variational derivative of the effective Lagrangian 

can be equal to zero modulo the divergence of the vector field if, and only if, the 
combination of terms enclosed to the square brackets in (225) is nil. It yields the 
equations of motion of the matter of bare perturbation 

Aa/?"" = 5 ■ (228) 

It should be compared with the law of conservation of matter in fiat background 
spacetime where it has a similar form Aq./ 3 ’^ = 0 of the total divergence [79]. The 
presence of the background matter fields makes the right side of (228) different 
from zero. This result was established in [73]. 

Equation (228) can be also interpreted as the integrability condition of the gravi- 
tational field equation (74). Taking a covariant derivative from both sides of the field 
equation (74) and applying the equations of motion (228) yields 



(pag + Fag) 



1/3 



= — 47T 






1 / 3 ' 



(229) 



In the linear approximation, when all quadratic and higher-order terms with respect to 
the perturbations are discarded (E^ ^ 0), the co variant divergence = 

0. It agrees with the assumption that the stress-energy tensor of the bare perturbation 
is conserved in the linearized perturbative order, = 0- Now we are set to start 

calculating equations of motion of matter of the bare perturbation in FLRW universe. 



6.2 Equations of Motion in the Universe Governed by Dark 
Matter and Dark Energy 

The dark matter and dark energy components of matter that governs the temporal 
evolution of the universe are modelled by scalar fields O and 4^. Hence, the tensor, 
= 0, and, consequently, the second term in the right side of (228) is identically 
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nil. Therefore, equations of motion of matter (228) can be written more explicitly in 
the following form 

+ r;/ + 1 + MqEO) u„ (230) 

where we have used definitions (163) of the gradients of the scalar fields O and 'T in 
terms of the background four- velocity u^. Next step is to calculate the explicit form 
of and as well as the covariant divergences of stress-energy tensors entering 
(230). We split the process of calculation in three parts — for gravitational field, for 
dark matter, and dark energy. 



6.2.1 Gravitational Field 



Covariant divergence from the stress-energy tensor of gravitational field, is 
derived from (207). We again need to employ the commutation relations (205), (206) 
along with a rule for the third order derivative 



I !i\pcny — I !i\ph'a pi\p^ jai' H” ^ 'j\p^'^ paiy H” I p\'-^ pav •> (231) 



which allows us (after one more commutation of the covariant derivative in /^^^ipz^cr) 
to derive 



I p\pay — ^p\po- I p\(7^ap I p\p^acr I ppa (232) 

H” I {3\p^^ paa H” I p\j3^^ paa I p^ap\(j I (3^^ ppa\(j' 



Significant number of terms cancel out, and after long calculation we obtain a rather 
simple result 



V'" = - \ - (233) 

where was defined in (208). After taking the covariant divergence, it is conve- 
nient to algebraically split (233) in three parts 




( 234 ) 
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where the first line describes the coupling of gravity with the stress-energy tensor 
of the bare perturbation, and the second and the third lines outline the contribution 
of dark matter (index ‘m’) and dark energy (index ‘q’). 



6.2.2 Dark Matter 



The source density of dark matter, depends only on the derivatives of the scalar 
field <i>, and can be written in the form of a covariant divergence 






m\iy 



with the matter current 






1 



dJ^ 



167tV^ 

The current is split in two components 

Ja = PmUai + PmPa'^jp, 

where 






J_£! 

2/im 



1 9 In Cg 

2/im 



(235) 



(236) 



(237) 



(238) 



1 






7/3 = 4” ( ^ («"</>«) 

Tim \ Ci)'- 



4>p 



(239) 






+ 



0-9 



22 



Taking covariant divergence from entangles a lot of algebraic operations which 

number can be significantly reduced by making use of the following procedure. We 
notice that the first term in the right side of (230) can be transformed to 
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Ufj^ + 3^H — H piYiPmjiy • (240) 

Furthermore, 



167ry^r“^ = 



dfipQi' 

dr^ 



( dr^ dr^ (5/in 






+ 



per ^9 pa j 

dT^ 



50^ ■ S/im 

Accounting for the variational derivatives (299), (322) we obtain 



(241) 

(242) 



1 _ _ 1 /_ _ dT^ 1 dT^ _ \ 

This equation elucidates that we do not need to directly calculate the partial deriv- 
atives with respect to the specific enthalpy when calculating the equations of 
motion (230). It saves us from doing a lot of algebra. 

We combine (243) with the divergence from the second line of (234) and denote 



^ - T . (244) 

Notice that is not symmetric with respect to its indices. Calculation reveals 




(245) 



Let us denote the density of the force caused by dark matter on the motion of the 
matter of the bare perturbation, by f^. After grouping all terms together the force 
density is defined by the following expression 
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3 






pa\fi 2 

1 






(246) 



H” 2 ^2 2 ^ PToaf^mju Pp 



which can be split in two orthogonal components 



C=a^Ui,+a^P\, 



(247) 



« = 



+ 



where = —u^f^ and = Pf/ f^. We have 

^Pm - 2 («“</>«) (u^AiS^ (248) 

(‘ “ 5) 

X + qii°‘uf^(j)al3 - {u°‘<j>a) + (M"0a) j 

+ 2pm7/ («“,/.«) (iq - 1/^ 

+ [z“^M«</>/3 + («“</>«) (3q - /) 



3 d\nc} / l \ , r> , ^ 



— 2^^ 



+ 



(249) 



Ipm ^1 - [Z/t"M^</>a/3 - qM"</>Ma + {u'^Ha) 4>IJ. + {u°'4>a) A^ 

^2q — -l^ 4>ij, 



^pmH I 



3 0 In 

H” Q PmPmH [(i/ 0Q;) 1^ Uq, — C(0^] . 

o ^Pm 



6.2.3 Dark Energy 

The dark energy source, E^, depends not only on the derivatives of the scalar field 'P 
but on the field itself through the field potential W = W(0). Therefore, according 
to definition (102) 
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= 



\6'K-y/—g 



dPi /dP\ 

’’’ Va'i'i.j, 



(250) 



After taking the variational derivatives we obtain 






5\I/3 2 d'i> 






+ 






(251) 



The force caused by dark energy on the motion of the matter of the bare perturbation is 



= _T-q w 



fq = 

j U — ‘ L 



+ 



— f 

167T \ 



— i 

Stt V 






w 



(252) 









After long but straightforward calculation we get 



= PqM/. (253) 

X ^/“^tZ'a/3 + Ualp/S^ 

+ {A°‘'tpa) “/U + ^ {A°‘Ua) 'IpiJ, + PqH ^2q - 

1 dW 1_ d^W / _ 1_\ 

~ - 2q^/’p) + ^PqV’-^^ yfj.uu" — -lu^ j . 



6.3 Final Form of the Equations of Motion 



After making use of the results of the presiding section, equations of motion (230) 
take on the following form 

~ ~ 2 ~ (254) 

+ AP%,-^-A,<I= f^ + ff 

The left side of this equation can be brought to a more conventional form if we use 
relation (93) between the stress-energy tensor of the bare perturbation and 
Let us take a covariant divergence of T^jj with respect to the full metric g^jj that 
is = g^^VyTp^ where denotes a covariant derivative with the connection 
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referred to the full metric so that = 0- The covariant derivatives from the 

stress-energy tensor are calculated with the help of 

~ ~ ( 255 ) 

where 0 ^ ^ = pf ^ — f f ^ is a tensor describing the difference between the perturbed 
and unperturbed values of the Christoffel symbols which we shall call the Christoffel 
tensor. In terms of the metric tensor perturbations 

^afi ~ 2 ^^"^ (^ 7 o|/^ ^'Jii\a ~ • (256) 

Equation (256) is exact. In the linear approximation with respect to the it can be 
expressed in terms of the background metric and perturbations as follows 

^afi — “ WI 7 ) 4 ~ , (257) 

where all quadratic terms with respect to /^^y have been neglected. Two contracted 
values of the Christoffel tensor are 

^ ( 258 ) 

Making use of these notations and definitions and doing direct calculation allows us 
to show that 



V.7)." = 1 (259) 

~2 ~ ~ 

Hydrodynamic equation of motion (254) becomes 

= /7 + f^. (260) 

Were the background spacetime fiat, the right side of (260) would vanish. However, 
in cosmology the background spacetime is given by FLRW metric, thus, making the 
divergence of the stress-energy-momentum tensor of the bare perturbation different 
from zero. The reader should not be confused here. Equation (260) is the law of 
conservation of the total stress-energy-momentum tensor given in the form, in which 
the covariant divergence of the stress-energy-momentum tensor of the bare pertur- 
bation is singled out and put to the left side. The right side represents the covariant 
divergence of the stress-energy-momentum tensor of the background matter in the 
disguised form of the sum of two vectors, and /^. 
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Appendix 1: Hilbert and Einstein Lagrangians 



We define the Christoffel symbols of the second kind as usual [46] 

k {^95(3,^ 4“ 96j,f3 9(3^,^ • (261) 

The Christoffel symbols of the first type 

~ ^octT f3j = ~ {9af3,^ 4“ 9a^,(3 ~ 9(3^,a) •> (262) 

We notice the symmetry with respect to the last two indices • There 

is no any symmetry with respect to the first two indices. In general, 



r'o;/3y — {a(3)^ 4” ^\a(3Yf-> 



(263) 



where 



1 1 . . 

k(o//3)y — ~9ap,j^ ^[0/dl7 ~ 2 ~ 9^f3^a) 



(264) 



There are two, particularly useful symbols that are obtained by contracting indices 
of the Christoffel symbols of the first kind. They are denoted as 



and 









(265) 

(266) 



Direct inspection shows that 

Va = = = (In . (267) 

The two symbols are interrelated 

kcK = ~ya 4“ 9^"^ 9oi(3,^'> 

r“ = -r - g^3p^ 



(268) 

(269) 
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We define the Riemann tensor as follows [46] 

(270) 

Second covariant derivatives of tensors do not commute due to the curvature of 
spacetime. For example, denoting the covariant derivatives with a vertical bar we 
have for a covector field !Fa and a covariant tensor of second rank, the following 
commutation relations 



(271) 

*7^ onl3\^S — *7^ a;/3|^7 H” u(3 H” Diji- {212) 



Riemann tensor can be also expressed in terms of the second partial derivatives of 
the metric tensor and the Christoffel symbols 



— 2 H" 9ua,pfi 9a(3,iiv 9fiu,a(5) ”1” ^ av ^ a(5' 



(273) 



Contraction of two indices in the Riemann tensor yields the Ricci tensor 



R 






pv,a 



- yp,. + - r 



7 ^ pu ■ 



1 p7 
z^7 A pai 



(274) 



or, in terms of the second derivatives from the metric tensor and the Christoffel 
symbols, 

l^piy — 2^ i^9pK,,ei' "b 9iyK,ep ~ 9Ke,pv ~ 9pi',Ke) ”b 9 ^ ppe^^ Kiy ~ ^ppv^^ • (275) 



One more contraction of indices in the Ricci tensor brings about the Ricci scalar 
which we shall write down in the form suggested by Fock [14, Appendix B] 



R = L + - 3 ;^,^, (276) 

where 

L = - yaV^pu) , (277) 

is (up to a constant factor) the gravitational Lagrangian introduced by Einstein [79] as 
an alternative to the gravitational Lagrangian, R, of Hilbert. The Hilbert Lagrangian 
is the Ricci scalar which depends on the second derivatives of the metric tensor while 
the Einstein Lagrangian does not. 

The two Lagrangians are interrelated 



R = L + (-gr^/^A'^,a, 



(278) 
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where 

(r“ - V") , (279) 

is a vector density of weight +1. After performing differentiation in (278), and 
accounting for (265) we can easily prove that (278) reproduces (276). 

One more form of relation between R and L will be useful for calculating the 
variational derivative in Appendix section “Variational Derivative from the Hilbert 
Lagrangian”. To this end we introduce a new notation 

r = + ( 280 ) 

and notice that 

(281) 

Equations (280), (281) allows us to cast (276) to the following form 

7? = L + r + (282) 

that was found by Fock [14, Appendix B]. 



Appendix 2: Variational Derivatives 



Variational Derivative from the Hilbert Lagrangian 



The goal of this section is to prove relation (45) being valid on the background 
manifold M. We shall omit the bar over the background geometric objects as it 
does not bring about confusion. We notice that the Hilbert Lagrangian density, = 
— (167 t)“^ differs from by a total derivative that is 

a consequence of (278). Due to relation (9) the Lagrangian derivatives from and 
coincides 



_ 6C^ 



(283) 



thus, pointing out that we can safely operate with the Einstein Lagrangian density 
C^. Because of (35), we have 






(284) 



which suggests that calculation of the variational derivative with respect to the metric 
tensor is sufficient. 

Calculation of the variational derivative 5C^ ! 5 demands the partial derivatives 
of the contravariant metric and Christoffel symbols with respect to . The partial 
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derivatives of the metric are calculated with the help of (37), (30). The Christoffel 
symbols are given in terms of the partial derivatives from covariant metric tensor, 
9a(3,-f which are not conjugated with the dynamic variable . Thus, calculation of 
the partial derivative with respect to from the Christoffel symbols demands its 
transformation to the form where the conjugated variables ^^^,7 are used instead. 
This form of the Christoffel symbols is 



Taking the partial derivative of (285) with respect to the contravariant metric yields 



where we have used (263). Contracting (286), (287) with the Christoffel symbols 
and the metric tensor results in 



Partial derivatives of the Christoffel symbols with respect to the metric derivatives 
are calculated from (285) with the help of (38). We get 



r“/37 = 1 - 5"‘^,75/3 <t) • (285) 




and 



Qgliu ~ 



dya 



(287) 




- y^y,. 



(288) 



(289) 



(290) 




(292) 



(291) 



Contracting (291), (292) with the Christoffel symbols and the metric tensor results in 
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(293) 



(294) 



(295) 



Explicit expression for the variational derivative of the Einstein Lagrangian is 



Replacing the partial derivatives in (296) with the corresponding right sides of 
Eqs.(37), (30), (293)-(295) and taking the partial derivative with respect to spa- 
tial coordinates, yields 



where we have used expressions (274), (276) for the Ricci tensor and Ricci scalar 
respectively. Substituting Eq. (297) to (284) yields 



Variational Derivatives of Dynamic Variables with Respect 
to the Metric Tensor 

Variational Derivatives of Dark Matter Variables 

The primary thermodynamic variable of dark matter is /im defined in (125). Varia- 
tional derivative from /im is calculated directly from its definition and yields 





(297) 




(299) 



Variational derivative of pressure pm is obtained from thermodynamic relation (121a) 
by making use of the chain differentiation rule along with (299), that is 
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^ P\XV 1 - - -n-i/ /OAA\ 

. (300) 

2 

Variational derivative of the rest mass and energy density are obtained by making 
use of (299) along with equation of state that allows us to express partial derivatives 
of Pm and 6m in terms of the variational derivative for pm- More specifically, 



^Pm 

Jem 



1 

2cj 

lc^_ 

let 



(301) 

(302) 



where the speed of sound appears explicitly. Variational derivatives from products 
and/or ratios of the thermodynamic quantities are calculated my applying the chain 
rule of differentiation and the above equations, 




PmPmU^u’^, 


(303) 


pm -n-l/ 
-^U^U , 


(304) 


Pm 




PmPmU^u''- 


(305) 



Variational Derivatives of Dark Energy Variables 



The primary thermodynamic variable of dark energy is pq defined in (125). Varia- 
tional derivative from pq is calculated directly from its definition. 



^9iiv 






(306) 



Variational derivative of the mass density pq of the dark energy “fluid” follows 
directly from pq = pq, and reads 



Spq 



1 

2 



pqU^U^ . 



(307) 



Variational derivative of pressure pq is obtained from definition (138) along with 
(306), which yields 



SPq 

5gnv 



-PqPqU^u''. 



(308) 
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Variational derivative of energy density eq is obtained by making use of (306) along 
with (137). More specifically, 



(309) 

2 ^ ^ 

Variational derivatives from products and ratios of other quantities are calculated my 
making use of the chain rule of differentiation and the above equations 

^ (PqMq) 

^{h\ 

\/^q/ 

^ (Pq - gq) 






0, 

0 . 



(310) 

(311) 

(312) 



Variational Derivatives of Four- Velocity of the Hubble Flow 



Variational derivatives from four- velocity of the fluid are derived from the definition 
(163) of the four- velocity given in terms of the potential O or ^ which are indepen- 
dent dynamic variables that do not depend on the metric tensor. Taking variational 
derivative from (163) and making use either (299) or (306) we obtain 



5u ci 
6u^ 



2 



— u^u^u^ 
2 



(313) 

(314) 

(315) 



where Eq. (315) accounts for the fact that <pa is an independent variable that does not 
depend on the metric tensor. 



Variational Derivatives of the Metric Tensor Perturbations 

Variational derivatives from the metric tensor perturbations are determined by 
taking into account that / -/g and is an independent dynamic variable 

which does not depend on the metric tensor. Therefore, its variational derivative is 
nil, and we have 
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Sg, 



fiiy 



\v 9 J ^dfiu \v 9 / ^ 



(316) 



Other variational derivatives are derived by making use of tensor operations of rising 
and lowering indices with the help of ga /3 and applying from (316). It gives 



^g^iu 

SI 

Sgtip 

6q 



= -\la(Sg^^+2la%\ 






(317) 

(318) 



S 

Sguv 



Sg/xi/ 



q (u'^u’^ + i + liil^u'') , (319) 

-r'' 



(320) 



Variational Derivatives with Respect to Matter Variables 



Variational Derivatives of Dark Matter Variables 



The dark matter variables do not depend on the Clebsch potential <t> directly but 
merely on its first derivative 4 >q,. Therefore, any variational derivative of dark matter 
variable, say, Q = Q(i>a), is reduced to a total divergence 



6Q _ d dQ 

5<t> ~ dx^ d^a ' 



(321) 



We present a short summary of the partial derivatives with respect to 



djJ^m 

dpm 

d^a 

dpm 

d^a 

dtm 

d (PmMm) 
d^a 



= m“. 


(322) 


— Pm^ ? 


(323) 


pm -oi 

= ~^—u , 
Ct Pm 


(324) 






= -^pmU"^, 


(325) 


/ ^2\ 






(326) 
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9 

\Mm/ 


(i - i) 


(327) 


9 (Pm ~ ^m) 

5<t>« 




(328) 


Partial derivatives of four velocity 






dua Pa^ 

d^P Pm 


d^a pa0 

d^P Pm 


(329) 


It allows US to deduce, for example. 






d(u^(l)a) _ 


Pm 


(330) 




dq 

9^a 


Pm 


(331) 



Variational Derivatives of Dark Energy Variables 



The dark energy variables depend on both the scalar potential ^ and its first derivative 
'Tq, in the most generic situation. This is because there is a potential of the scalar 
field 1T(0) that is absent in case of the dark matter. Therefore, variational derivative 
of the dark energy variable, say, H = is 



5A _dA d dA 
~ dx^ d^a ’ 



(332) 



Partial derivatives dA/ S'T = (dA/dW) (dW / S'T, and their particular form depends 
on the shape of the potential W. As for the partial derivatives with respect to the 
derivatives of the field, they can be calculated explicitly for each variable, and we 
present a short summary of these partial derivatives below. More specifically. 



dPq 

dpq 

deg 
d^a 
9 {pqPq) 









= u , 



= PqU"^, 
= 2pqfi^, 



(333) 

(334) 

(335) 

(336) 



( 337 ) 
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^ (Pq - gq) 

d'^a 



Partial derivatives of four velocity 

dua _ pj du°‘ _ 

Aq ’ Aq 



It allows us to deduce, for example, 



dju^-lpa) 

5't'a 



Mq 



f^q 



( 338 ) 

( 339 ) 

( 340 ) 

( 341 ) 

( 342 ) 
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Abstract The super-massive 4 million solar mass black hole Sagittarius A* (SgrA*) 
shows flare emission from the millimeter to the X-ray domain. A detailed analysis of 
the infrared light curves allows us to address the accretion phenomenon in a statistical 
way. The analysis shows that the near-infrared flare amplitudes are dominated by a 
single state power law, with the low states in SgrA* limited by confusion through 
the unresolved stellar background. There are several dusty objects in the immediate 
vicinity of SgrA*. The source G2/DSO is one of them. Its nature is unclear. It may be 
comparable to similar stellar dusty sources in the region or may consist predominantly 
of gas and dust. In this case a particularly enhanced accretion activity onto SgrA* 
may be expected in the near future. Here the interpretation of recent data and ongoing 
observations are discussed. 



Sagittarius (SgrA*) at the center of our Galaxy is a highly variable radio, near- 
infrared (NIR), and X-ray source which is associated with a 4 x lO^Mo super- 
massive central black hole (SMBH). SgrA* is the closest SMBH and can be taken as 
a paradigm for quiescent or very Low Luminosity Active Galactic Nuclei (LLAGN). 
It has been shown that the strong polarized (up to several 10 %) infrared flux density 
excursions (often referred to as flares) from SgrA* show patterns of strong gravity 
as expected from in-spiraling material very close to the black hole’s horizon [1-3]. 
As a consequence of the strong gravitational held of the black hole these patterns 
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express themselves in characteristic variations of the light curve due to the effect of 
relativistic boosting, light bending and the rotation of the polarization angle 11-5]. 
Therefore, it is mainly the polarization and the strong flux variability that give us 
certainty that we study the immediate vicinity of a SMBH 11-3, 6]. 

A fast moving infrared excess source G2, which is widely interpreted as a core-less 
gas and dust cloud 17], approaches SgrA* on a presumably elliptical orbit covering 
the region of the high velocity S-stars close to SgrA*. The passage of this Dusty S- 
cluster Object (DSO) is expected to result in accretion phenomena which will give an 
improved insight into the nature of the immediate surroundings of the SMBH SgrA*. 
The year 2013 certainly was only the first in an intense observing campaign during 
which the immediate vicinity of SgrA* is being monitored while the dusty object is 
close. Recent orbit determinations expect the peri-apse passage to occur in April/May 
2014 18]. Based on recent conference contributions (see Acknowledgments) on the 
Galactic Center this article concentrates on three major topics: (1) SgrA* and its 
environment, (2) its relation to extragalactic nuclei, and (3) new instrumentation and 
perspectives. 



1 The Galactic Center 



The DSO source approaching the super-massive black hole SgrA* at the center of 
the Milky Way has spawned great activities in observing that region covering the 
entire electromagnetic spectrum from radio, via infrared to X-ray wavelengths using 
telescopes across the world. The upcoming events underline the importance of the 
Galactic Center as a laboratory to investigate and understand phenomena in the 
immediate environment of super massive black holes 19, 10]. Gas and stars within 
the Galactic Center stellar cluster provide the fuel for the central super-massive black 
hole and hence the reason for most of the flux density variability observed from it. 



1.1 The Variability of Sagittarius 

Progress has been made in the understanding of the emission process associated 
with the immediate surroundings of the super-massive black hole counterpart SgrA* 
as well as the three-dimensional dynamics and the population of the central stellar 
cluster. There is also ample evidence of interactions between the cluster and SgrA*. 

Series of monitoring observations in the near-infrared (NIR), X-ray, and sub- 
millimeter (sub-mm) regimes accumulated over the years allowed us to perform for 
the first time detailed statistical studies of the variability of SgrA* 111-13]. The 
analyses show that the histogram of the near-infrared flux density is a pure power- 
law and the emission process is most likely dominated by synchrotron radiation. 
In 111], we present a comprehensive data description for NIR measurements of 
SgrA*. We characterized the statistical properties of the variability of SgrA* in the 
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near-infrared, which we found to be consistent with a single-state process forming 
a flux density power-law distribution. We discovered a linear rms-flux relation for 
the flux density range up to 12 mJy on a time scale of 24 min. This and the structure 
function of the flux density variations imply a phenomenological, formally nonlinear 
statistical behavior that can be modeled. In this way, we can simulate the observed 
variability and extrapolate its behavior to higher flux levels and longer time scales. 
SgrA* is also strongly variable in the X-ray domain [12-17] and references there in, 
as well as [18, 19] for a recent strong flares observed with Chandra and NuSTAR. 
The detailed statistical investigation in [1 1] also suggests that the past strong X-ray 
variations that give rise to the observed X-ray echos can in fact be explained by 
the NIR variability histogram under the assumption of a Synchrotron Self Compton 
(SSC) process. SgrA* is extremely faint in the X-ray bands, though strong activity 
has been revealed through the detection of flares. Therefore, SgrA* is the ideal target 
to investigate the mass accretion and the ejection physics in the case of an extremely 
low accretion rate onto a super-massive black hole. This is actually the phase in which 
super-massive black holes are thought to spend most of their lifetime. The activity 
phase onset of a magnetar (see section below) at a separation of only about 3 arcsec 
from the Galactic Center presented a problem for the SgrA* monitoring program in 
2013 [20-22]. 

Simultaneous observations and modeling of the millimeter (mm), NIR, and X-ray 
flare emission of SgrA* are presented in [12, 13]. These data allowed us to investigate 
physical processes giving rise to the variable emission of SgrA* from the radio to the 
X-ray domain. In the radio cm-regime SgrA* is hardy linearly polarized but shows 
a fractional circular polarization of around 0.4 % [23, 24]. The circular polarization 
decreases towards the mm-domain [25], where as the authors of [26] report a variable 
linear polarization from SgrA* of a few percent in the mm- wavelength domain. The 
observations reveal flaring activity in all wavelength bands. The polarization degree 
and angle in the sub-mm are likely linked to the magnetic field structure or the general 
orientation of the source. In general — the NIR emission is leading the sub-mm with a 
delay of about one to two hours (see below) and the excursions in the NIR and X-ray 
emission are rather simultaneous. As a result we found that the observations can be 
modeled as the signal from an adiabatically expanding source component [27-29] of 
relativistic electrons emitting via the synchrotron/SSC process. A large fraction of 
the lower energy mm/cm- flux density excursions is not necessarily correlated with 
the NIR/X-ray variability (see, e.g., [30, 31] and details and further references in [12, 
13]). One may compute the SSC spectrum produced by up-scattering of a power-law 
distribution of sub-mm- wavelength photons into the NIR and X-ray domain by using 
the formalism given by [32, 33]. Such a single SSC component model may be too 
simplistic, although it is considered as a possibility in most of the recent modeling 
approaches. It does not take into account possible deviations from the overall spectral 
index of a = 1.3 at any specific wavelength domain like the NIR or X-ray regime. 

The number density distribution of the relativistic electrons responsible for the 
synchrotron spectrum can be described by 



-2a+l 



N(E) = NqE 



( 1 ) 



762 



A. Eckart et al. 



direction 
of jet or 
outflow 



increasing electron density 
towards darker shading 
10 R 

G 





blob 



Orbit 



blob 



bulk 



black hole 





t ^bulk 


blob 




blob trajectory 
within jet 
or outflow 


A black hole 




— withlSCO 



Fig. 1 Modeling approaches and the involved relativistic motions. We schematically compare 
the approaches through disk {top right) and jet {bottom right) modeling to relativistic magneto 
hydrodynamic modeling {left) 



with 7^ between 71 and 72 which limit the lower and upper bound of the relativistic 
electron spectrum 

71 < E = 7^mc^ < 72mc^. ( 2 ) 

Lorentz factors 7^ for the emitting electrons of the order of a few thousand are 
required to produce a sufficient SSC flux in the observed X-ray domain. In addition the 
relativistic bulk motion of the orbiting or outward traveling component is described 
by the bulk Lorentz factor F (see Fig. 1 ). On the left of this figure we show a sketch of 
a typical relativistic electron density distribution resulting from MHD calculations 
[ 30 , 34 , 35 ]. We show a cut through only one side of the three dimensional structure. 
The central plane and outflow region resulting from these calculations can be modeled 
in different, dedicated approaches (top and bottom right). 

In the central plane modeling the flux variations are assumed to be the result of 
the motion of an orbiting blob or a hot spot (see for example [ 4 ]). In the outflow 
model the flux variations are assumed to be due to the ejection of a blob and its 
motion along the jet (with bulk motions close to the speed of light) or a much slower 
overall outflow component. In this case — for VLB I observations — the larger outflow 
extent at increasingly lower radio frequencies would be hidden by the decreasingly 
lower angular resolutions due to interstellar scattering. In Fig. 2 the accretion disk 
(here assumed to be edge-on) is shown as a vertical thick line to the right, the dashed 
part indicates the disk sections in the back- and foreground. Extending to the left 
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Fig. 2 Sketch of a possible source structure for the accretion disk around the SMBH associated 
with SgrA* following Fig. 12 in [36] 

we show one side above the disk. Here higher energy flare emission (lower part) is 
assumed to be responsible for the observed NIR/X-ray flare emission. Lower energy 
flare emission (upper part) may substantially contribute to long wavelength infrared 
emission. In addition to the expansion towards and beyond the the mm-source size, 
radial and azimuthal expansion within the disk may occur. Hence, long mm/cm- 
wavelength variability may originate from different source components of SgrA* 
and may be difficult to be disentangled based on radio data alone. 

In Fig. 1 the relativistic boosting vectors for the electrons 7 ^ and the bulk motion 
Tbuik are not drawn to a proper relative scale. One can assume 7 ^ > Tbuik^ In the case 
of relativistically orbiting gas as well as relativistic outflows one may use modest 
values for F. Both dynamical phenomena are likely to be relevant in the case of 
SgrA*. The size of the central plane synchrotron component is assumed to be of the 
order of or at most a few times the Schwarzschild radius. From the overall variable 
radio/sub-mm spectrum spectrum we can assume a turnover frequency of a few 
100 GHz (see details in [12]). The motion of the synchrotron emitting cloud can be 
described via 




( 3 ) 

( 4 ) 



Tbuik = 
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Here (3 = vie and v is the speed of the bulk motion of the synchrotron cloud, 5 is the 
Doppler factor and the angle to the line of sight. For a relativistic bulk motion with 
Tbuik around 1 .7 zb 0.3 (i.e. angles 0 of about 30° zb 15°) the corresponding magnetic 
field strengths are often assumed to be of the order of a few ten Gauss, which is also 
within the range of magnetic fields expected for RIAF models [37, 38]. 

Modeling of the light curves shows that (at least for the brighter events) typically 
the sub-mm flux density excursions follow the NIR emission with a delay of about 
one to two hours with an expansion velocity of about 0.01-0.001 c [27-29]. We find 
source component sizes of around one Schwarzschild radius, flux densities of a few 
Janskys, and steep spectral indices. Typical model parameters suggest that either 
the adiabatically expanding source components have a bulk motion larger than its 
expansion velocity or the expanding material contributes to a corona or disk, confined 
to the immediate surroundings of SgrA*. For the bulk of the synchrotron and SSC 
models, we find synchrotron turnover frequencies in the range of 300-400 GHz. 
For the pure synchrotron models, this results in densities of relativistic particles in 
the mid-plane of the assumed accretion flow of the order of 10^-^ cm“^, and for the 
SSC models the median densities are about one order of magnitude higher. However, 
to obtain a realistic description of the frequency-dependent variability amplitude 
of SgrA*, models with higher turnover frequencies and even higher densities are 
required. This modeling approach also successfully reproduces the degree of flux 
density variability across the radio to far-infrared spectrum of SgrA*. In Fig. 3 we 
show observed flux densities of SgrA* taken from the literature (blue) compared to a 
combined model that consists of the fit given in [39, 40] (black line), and [34] (black 
dashed line). We plotted in red the spectra of synchrotron self-absorption frequencies 
for the range of models. Here we show results for the preferred synchrotron plus SSC 
(SYN-SSC) model that most closely represents the observed variability of SgrA*. 

In [41] theoretical polarimetric light curves expected in the case of optically thin 
NIR emission from over-dense regions close to the marginal stable orbit are presented 
(see also [1-5]). Using a numerical code the authors track the time evolution of 
detectable polarization properties produced by synchrotron emission of compact 
sources in the vicinity of the black hole. They show that the different setups lead to 
very special patterns in the time-profiles of polarized flux and the orientation of the 
polarization vector. As such, they may be used for determining the geometry of the 
accretion flow around SgrA* (see also [2, 3, 42]). 

During the 201 3 Bad Honnef and the Granada conference (see Acknowledgments) 
efforts to monitor SgrA* during the DSO fly-by and first observational results from 
2013 were reported in [8, 43-49]. The NRAO Karl G. Jansky Very Large Array 
(VLA) is undertaking an ongoing community service observing program to follow the 
expected encounter of the DSO cloud with the black hole SgrA* in 2013/2014 [50]. 
The NRAO VLA has been observing the Sgr A region since October 2012 on roughly 
a bi-monthly interval, cycling through eight observing bands. For monitoring the flux 
densities and in particular the radio spectral indices the short wavelength observations 
(A < 6 cm) are most useful. For 2012/2013 no particular flux density variation was 
detected that could be attributed to the interaction between SgrA* and the DSO. 
This may be linked with the fact that the newly determined periapse passage is now 
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Fig. 3 The variable radio 
spectrum of SgrA*: 
measurements and model 
results (see text and [12, 13] 
for details) 




frequency in GHz 



expected to happen in April/May of 2014 [8], i.e. later than originally anticipated. 
However, in the radio- shock frame in which variations of up to several Jansky s were 
expected even during the pre-periapse time. Hence, the lack of strong radio flares 
indicates that the medium is less dense than expected and/or that the bow-shock size 
i.e. the cross-section of the dust source is much smaller than assumed [51-56]. 



1.2 VLBI Imaging of SgrA"^ 

There is also profound progress in imaging and modeling of the central putative 
accretion disk of SgrA* as well as the jet that may be associated with the source [35, 
41, 57]. In fact imaging of SgrA* may turn out to be a Rosetta Stone in the attempts 
of distinguishing between different relativity theories of black holes (see [58], e.g., 
2013, on astronomical tests of general relativity and the pseudo-complex theory). 

VLB I (Very Long Baseline Interferometry) observations at very short millimeter 
radio wavelengths can overcome the effects of interstellar scattering and allow us to 
study the source intrinsic structure of SgrA*. Large mm/sub-mm facilities like the 
VLB A (Very Long Baseline Array), VERA (VLB I Exploration of Radio Astrometry), 
ALMA (Atacama Large Millimeter Array), PdBI (Plateau de Bure Interferometer) 
and the sensitive mm-telescopes in the EVN (European VLBI Network) — such as 
the IRAM 30 m and the 100 m Effelsberg telescopes — are participating in this effort, 
which will eventually culminate in the project EHT (Event Horizon Telescope), a 
VLBI array especially designed to image the structures close to the event horizons 
of the larges SMBHs in the sky — namely SgrA* and M87, with 1 Schwarzschild 
radius extending to an angular size of about 10 and 3.7 jjtas, respectively. Multi- 
epoch imaging observations will allow to constrain the locations and sizes of the 
flaring region of SgrA* within the putative temporal accretion disk of the accre- 
tion stream/flow towards or an accretion wind from SgrA*. These measurements 
will also constrain the acceleration processes (e.g. magnetic reconnection events or 
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non-axisymmetric standing shocks) that give rise to the population of relativistic 
electrons and the variable emission we see from SgrA*. Ultimately, alternative black 
hole models will be probed and attempts to test the black hole no-hair theorem will 
be possible with the new VLB mm/sub-mm facilities [43, 44, 59-65]. 

These VLBI experiments will eventually enable spatially resolved studies on sub- 
horizon scales, leading to an unprecedented exploration of a putative predicted black- 
hole shadow [66] as an evidence for light trapping by the black hole as well as its 
interaction with the surrounding material. It will be possible to monitor the possible 
expansion of source components during flare activity. Furthermore, when a rotating 
black hole is immersed in a magnetic field of external origin, the gravito-magnetic 
interaction is capable of triggering the magnetic reconnection, accelerating the par- 
ticles to very high energy [67-69]. This frame-dragging phenomenon is particularly 
interesting in the context of exploring the strong-gravity effects in astrophysical black 
holes because the effect does not have a Newtonian counterpart and it operates on 
the border of the ergospheric region [70], i.e. very close to the black hole horizon, 
and it can be probed with the future EHT. Also, one can investigate if the black 
hole proximity generates conditions favorable to incite the magnetic reconnection 
that eventually leads to plasma heating and particle acceleration. This effect could 
contribute to the flaring activity. 

1.3 The Importance of Dusty Sources Close to the Center 

A major discussion point is if and how the DSO source will be disrupted during its 
peri-bothron^ passage. It may be only its dusty envelope that will be disrupted since 
the K^-band identifications of the source suggest that it can also be associated with 
a star [45]. In addition to the VLT NACO and the Keck NIRC detections of the DSO 
NIR continuum emission [46], here we show the detection of the DSO continuum at 
about K ~ 19 using SINFONI data (Fig. 4). The detection of the continuum emission 
in data sets taken with three different instrumental setups over many years strengthens 
the case for a substantial continuum emission from that dusty source. As posted in 
the astronomer’s telegram No. 6110 on 2 May 2014 [72], the DSO was detected 
3.8 |JLm during its peri-bothron passage around the central black hole SgrA*. Hence, 
it appears to be intact and up to this point not yet heavily affected by tidal effects. 
This clearly supports our finding [46] that it may very well be a dusty star rather than 
a pure gas and dust cloud. 

In contrast to a pure dust and gas nature of the DSO its possible stellar (i.e. a 
dust enshrouded star) nature is discussed and partially favoured in [8, 45^7, 49, 
73-75] . Eckart et al. [45] investigate the possible mass transfer across Lagrange point 



^Peri- or apo-bothron is the term used for peri- or apoapsis — i.e. closest or furthest separation — for 
an elliptical orbit with a black hole present at one of the foci. As already mentioned by Frank and 
Rees in [71] the word ‘bothros’ was apparently first suggested in the context of black holes by W.R. 
Stoeger. It originates from the greek word 6 [366 with the equivalent meaning of ‘the sink’ or 
‘the deep dark pit’ . 



The Galactic Center Black Hole Laboratory 



767 




Fig. 4 The DSO detected in its K-band continuum emission in 2010 SINFONI data. Left The 
original image (positive greyscale); Right A LUCY deconvolved image (negative greyscale) shown 
at an angular resolution close to the diffraction limit of the VLT UT4 



LI in a simple Roche model. If the star has a mass of about IMq, the separation 
of LI from it will be about 0.1 AU. For a Herbig Ae/Be stars with 2-8 Mq that 
distance will be between 0.2 and 0.5 AU. For a typical S-cluster stellar mass of ^^20- 
30 Mq the separation will be closer to one AU. The interferometrically determined 
inner ring sizes that one typically finds for young Herbig Ae/Be and T Tauri stars 
can indeed be as small as 0.1-1 AU [76]. Any stellar disk or shell may already have 
been stripped substantially if the DSO has performed more than a single orbit. If 
the source has a size of about 1 AU (as determined from its MIR-luminosity; [7]) 
then a significant amount of the dusty circumstellar material may pass beyond LI 
during peri-bothron passage. This material will then start to move into the Roche 
lobe associated with SgrA*. 

However, it is not at all clear what will happen to the transferred material after 
the peri-bothron passage around May 2014 or beyond. The fact that this dusty object 
may be a dust enshrouded star rather than a dust cloud will have an influence on 
the expected flux density variations resulting from the close approach. They may be 
much weaker than expected. Simulations [77-79] that have discussed the feeding 
rate of SgrA* as a function of radius indicate that a portion of the material may 
fall towards SgrA*. If SgrA* is associated with a significant wind on scale of the 
peri-bothron separation, then a large part of the material may be blown away again 
by an out-bound accretion wind. Shcherbakov and Baganoff [80] have discussed the 
feeding rate of SgrA* as a function of radius. Based on their modeling one may 
suggest that the bow-shock sources X3 and X7 [81] are still in the regime in which 
most of the in-flowing mass is blown away again. Another case for comparison is 
the star S2. During its peri-bothron passage the star has been well within the zone in 
which matter of its (weak) stellar wind could have been accreted by SgrA*. The DSO 
peri-bothron will be at a larger radius than that of S2 [8]. This may imply that no 
enhanced accretion effect will result from it during the peri-bothron passage. Until 
May 2014 no increase in variability and no significant flux density increase well 
above normal levels has been reported in the radio to X-ray domains. 
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The fate of the DSO and the cometary sources X3 and X7 underline the importance 
of investigating the wind properties in the vicinity of SgrA* in more detail. IRS 8 
is a unique possibility to study the bow shock properties and polarization features 
in the dusty environment at the Galactic Center. Based on a detailed study of near- 
infrared emission Rauch et al. [82] present interstellar dust properties for the northern 
arm in the vicinity of the IRS 8 bow shock. This study allowed us for the first 
time to determine the relative positioning of IRS 8 with respect to the northern arm 
and the super-massive black hole SgrA*. The result indicates that the central star 
of IRS 8 is in fact located closer towards the observer than the northern arm. In 
[45] we investigated the near-infrared proper motions and spectra of infrared excess 
sources at the Galactic Center. The work concentrated on a small but dense cluster 
of comoving sources (IRS13N) located ?>" west of SgrA*. Our analysis shows that 
these stars are spectroscopically and dynamically young and can indeed be identified 
with continuum emission at 2 |xm and shortward, indicating that these mid-infrared 
sources are not only dust sources but young stars. The possibility of ongoing star 
formation at the Galactic Center is supported through simulations [48]. In fact the 
DSO may be a representative of dusty sources similar to those discussed in [29, 
Fig. 14] and compare also to the discussion of sources X3 and X7 in [81]. Meyer 
et al. [73] present NIR spectroscopic data of several of these sources. They also show 
that the DOS does not seem to be unique, since several red emission-line objects can 
be found in the central arcsecond. In summary, the authors of [73] conclude that it 
seems more likely that G2 is ultimately a stellar source that is clearly associated with 
gas and dust (see also [45-47]). 



1.4 Stellar Dynamics and Tests of Relativity 

SgrA*, the super-massive black hole at the center of the Milky Way, is surrounded 
by a small cluster of high velocity stars, known as the S-stars [83]. Sabha et al. 
[84] aimed at constraining the amount and nature of the stellar and dark mass that 
is associated with the cluster in the immediate vicinity of SgrA*. The authors use 
near-infrared imaging to determine the Ks-band luminosity function of the S-star 
cluster members, the distribution of the diffuse background emission and the stellar 
number density counts around the central black hole. This allows us to determine the 
stellar light and mass contribution expected from the faint members of the cluster. In 
[84] post-Newtonian N-body simulations are used to investigate the effect of stellar 
perturbations on the motion of S2, as a means of detecting the number and masses 
of the perturbers. The authors find that the stellar mass derived from the Ks-band 
luminosity extrapolation is much smaller than the amount of mass that might be 
present considering the uncertainties in the orbital motion of the star S2. Also the 
amount of light from the fainter S-cluster members is below the amount of the residual 
light at the position of the S-star cluster after one removes the bright cluster members. 
If the distribution of stars and stellar remnants is peaked near SgrA* strongly enough, 
observed changes in the orbital elements of S2 can be used to constrain both the 
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masses and the number of objects inside its orbit. Based on simulations of the cluster 
of high velocity stars we find that in the NIR K-band — close to the confusion level 
for 8 m class telescopes — blended stars will occur preferentially near the position 
of SgrA* which is the direction towards which we find the highest stellar density. 
These blended stars consist of several faint, (with the current facilities) individually 
undetectable stars that get aligned along the line-of-sight, producing the visual effect 
of a new point source. The proper motion of stars and the corresponding velocity 
dispersion leads to the fact that such a blended star configuration dissolves typically 
after 3 years. 

Stars that get very close to the super massive black hole are ideal probes to analyze 
the gravitational field and to search for effects of relativity due to the presence of the 
high mass concentration and its effect on space time. This can be done by tracing 
the orbit of stars through proper motions and radial velocities. As discussed in [85] 
relativistic effects should express themselves spectroscopically. The redshift z of a 
black hole orbiting star can be written as: 

Z = AA/A = Bo + Bif5 + B20^ + 0(/3^) (5) 

with Bo being an offset, B\(3 describing the Doppler velocity and B20^ expressing 
the relativistic effects. Here the value B 2 contains equal contributions from the grav- 
itational redshift and the special relativistic transverse Doppler effect. The combined 
effect gives a redshift that is about an order of magnitude larger than the currently 
achieved spectral resolution of (5A/A ~ 10“"^. For S2 one expects about a 150- 
200km/s signal measurable over a few months on top of an orbit-depending radial 
velocity of more than 4000 km/s. Expectations are high that this will be observable 
during the next peri-bothron for S2 around 2017.9 zb 0.35 [86, 87] or S2-102 around 
2021.0 zb 0.3 [49]. Realistically, however, one needs several stars on different orbits 
to detect the relativistic effect with certainty ([85]; see also [88] for peri-bothron 
shift). Alternatively, one has to find stars that are (or get) closer than S2 and S2-102 
([49]; see below) to SgrA*. 

Detailed imaging and the analysis of proper motions may be another way to 
trace relativistic effects. An important deviation from Keplerian motion occurs as 
a result of relativistic corrections to the equations of motion, which to the lowest 
order predict a certain advance of the argument of peri-bothron each orbital period. 
Choosing a = 5.0 mpc and ^ = 0.88 for the semi-major axis and eccentricity of S2, 
respectively, and assuming a black hole mass of M, = 4.0 x 10^ Mq this advance 
will be 

67tGM, , 

(Ao;)gr = ^ « 10.8'. (6) 

c^a{\ — e^) 

The relativistic precession is prograde, and leaves the orientation of the orbital plane 
unchanged. 

The location of the peri-bothron advances for each orbital period due to the 
spherically- symmetric component of the distributed mass that is resolved by the 
elliptical orbit of the star. The amplitude of this Newtonian “mass precession” is 
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(Acj)m = -27tGm(^, 7 )\/ 1 - 



(7) 



Here, = M^(r < a) is the distributed mass within a radius r = a, and Gm is a 
dimensionless factor of order unity that depends on e and on the power-law index of 
the density, p oc r~'^ [89]. In the special case 7 = 2, 



Mass precession is retrograde, i.e., opposite in sense to the relativistic precession. 

The contribution of relativity to the peri-bothron advance is determined uniquely 
by the known values of a and e. A measured Auj can then be used to constrain the 
mass enclosed within S2’s orbit, by subtracting (Acj)gr and comparing the result 
with Eq. (9). So far, this technique has yielded only upper limits on of ~ 

[90]. First robust upper limits of that order have been derived in [91, 92]. The role 
of relativistic effects versus the gravitational effects of the nuclear star cluster and a 
ring of stars are summarized in [93]. 

In addition to the relativistic effects on the orbit of a test star, and the precession 
due to the smooth matter distribution, the granularity of the cluster stars and stellar 
remnants needs to be considered. The granularity of the distributed mass makes 
itself felt via the phenomenon of resonant relaxation (RR; [94, 95]). On current 
observational time scales the stellar orbits near SgrA* remain nearly fixed in their 
orientations. The perturbing effect of each field star on the motion of a test star (e.g. 
S2) can be approximated as a torque that is fixed in time and proportional to the 
mass m of the field star. Sabha et al. [84] show that the effects of RR are competing 
with the relativistic and Newtonian periastron effects on the orbits of the known high 
velocity S-cluster stars. Following [84], we show in Fig. 6 the predicted change in 
S2’s orbital elements over the course of one orbital period ("^15.6 year). The shift due 
to relativity, Acjgr ~ 1 has been subtracted from the total; what remains is due to 
Newtonian perturbations from the field stars. Each histogram was constructed from 
integrations of 100 random realizations of the same initial model, with field- star mass 
m = IOMq, and four different values of the total number: N = 200 (solid/black); 
N = 100 (dotted/red); N = 50 (dashed/blue); and N = 25 (dot-dashed/green). The 
average value of the peri-bothron shift increases with increasing Nm, as predicted by 
Eq. (10). There is also a separate contribution that scales approximately as l/\fN 
and that results in a variance about the mean value. 

Sabha et al. [84] show that the net effect of the torques from N field stars is to 
change the angular momentum, L, of S2’s orbit according to 




( 8 ) 



so that 




(9) 




( 10 ) 
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where is the angular momentum of a circular orbit having the same semi-major 
axis as that of the test star. Here Eq. 10 describes “coherent resonant relaxation”. 
Sabha et al. [84] state that the normalizing factor K should be of order unity [96]. 

Changes in L imply changes in both the eccentricity of the test star orbit, as well 
as changes in its orbital plane. Changes in the orbital plane can be described in a 
coordinate-independent way via the angle A6>, where 

Li • L2 

cos (A6>) = (11) 

L\L2 

and {Li , L 2 } are the values of L at two times separated by Ar [84]. For small values 
of A6> this can be written as 



A6> 




Li L2 

2— ^ 

L1L2 



Li L2 

L1L2 ■ 



( 12 ) 



If we set Ar equal to the orbital period of the test star, the changes in its orbital 
elements due to resonant relaxation are expected to be 

IAcIrr « (13) 

(A0)rr « InKtVN^, (14) 

where N is the number of stars having ( 2 -values similar to, or less than, that of the 
test star [84] . The constants Ke and Kt may depend on the properties of the orbits of 
the field star distribution in the Galactic Center stellar cluster. More details are given 
in [84]. The Kozai mechanism is another kind of resonant process that is thought to 
operate in the Galactic Center environment [97-99]. 

Figures (following [100]) plots the location of the S stars in the (a, ^) -plane 
(semimajor axis — ellipticity). The data for the two currently closest known stars 
S2 and S2-102 are indicated by pattern filled circles. For the two currently closest 
sources one finds: 



SO - 102 : Torbit = 11.5 zb 0.3 yr ^ = 0.680 ± 0.020 a = 0.100'^ zb 0.010^' 
S2 : Torbit = 15.6 zb 0.4 yx e = 0.883 zb 0.003 a = 0.125'' zb 0.002" 



([49, 86, 87]; 1” at the distance of the Galactic Center corresponds to alinear scale 
of about 39 mpc = 0.127 lyr = 8044 AU). 

Also plotted in Fig. 5 are curves indicating where the effects of RR begin to be 
mediated by relativistic precession of the “test” star. The upper curve is the “Schwarz- 
schild barrier” [101]; stars below this curve precess due to GR so rapidly that their 
precession, rather than the mean precession rate of the field stars, determines the 
coherence time over which the RR torques can act. This is reflected in the horizontal 
tick marks which give the expected amplitude of eccentricity changes as an orbit 
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Fig. 5 An example for the 
location of the Galactic 
Center S- stars on the 
(a, e) -plane. See details in 
the text and [100], in 
particular the left panel of 
their Fig. 1 




1000 mas 



100 mas 



10 mas 



precesses in the essentially fixed torquing field due to the field stars. The location 
of the Schwarzschild barrier in the (a, -plane depends somewhat on the assumed 
spatial distribution of the stars in the nuclear cluster, but for most reasonable distribu- 
tions, S2 lies below the barrier [100]. Closer to SgrA*, frame-dragging torques due to 
the spin of SgrA* begin to make themselves felt. The two lower curves on Fig. 6 mark 
where these torques begin to compete with RR torques. In this regime, orbits process 
so rapidly that their eccentricities are essentially unaffected by the \/A torques, but 
their orbital planes can still change (“vector RR”). Below the curves marked “Kerr” 
in Fig. 5 are curves indicating where the effects of RR begin to be changes in orbital 
planes due to frame dragging dominate the changes due to vector RR [102]. Stars in 
this regime can be used to test theories of gravity, e.g., “no-hair” theorems. 

Figures also shows the (a, -regions that will be experimentally accessible with 
the new upcoming interferometric instrumentation in the NIR like GRAVITY at the 
VLTI and LINC-NIRVANA at the LBT (see below). Only for stars at much smaller 
orbital separations from SgrA* than S2 or S2-102, as they may be found with infrared 
interferometers like GRAVITY at the VLTI or LINC-NIRVANA at the LBT (see 
below), are Newtonian perturbations (mass precession, RR) negligible compared 
with the effects of GR. However, even then one will need more than one star with 
different orbital elements in order to clearly demonstrate that the eventually observed 
orbital changes are truly due to relativistic effects (see, e.g., [85, 88, 102, 103]). 
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Fig. 6 Histograms of the 
predicted change in S2’s 
argument of peri-bothron, uo, 
over the course of one orbital 
period (~15.6 year; see also 
[84]). See text for a detailed 
description of the figure 




Acj — Acjgp (arcmin) 



1.5 Effects Due to Stellar Collisions 

The stellar density close to center is in excess of 10^ Mq per cubic-parsec. This 
implies that in this region the evolution of stars is influenced by stellar collisions 
(e.g. [104, 105]). Observations of the stars at the Galactic Center show that there 
is a lack of red giants within about 0.5 pc of the super-massive black hole. The 
very high stellar number densities this close to the SMBH imply that the giants — or 
their progenitors — may have been destroyed by stellar collisions. The derivation of 
collisional rates between different types of stars at the Galactic Center and their likely 
effects are currently a held of intense investigation in order to quantify how large 
a contribution stellar collisions could make to the puzzle of the missing red giants 
[106]. As an example see recent contributions on star formation (see also references 
below), and on the structure of the central stellar cluster and stellar interactions in 
the Galactic Center area by [84, 107-1 12]. 



1.6 Pulsars at the Galactic Center 

Another very promising way of investigating the super-massive black hole properties 
in great detail is to And pulsars orbiting SgrA*. If close enough, they would allow 
us to measure the spin and quadrupole moment of the central black hole with superb 
precision enabling us to test different theories of gravity [113]. The number of pul- 
sars in the central cluster will depend on the star formation history in the overall 
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region. The discovery of radio pulsations from a magnetar PSR J1 745-2900 with the 
Effelsberg telescope has highlighted the great value and the efforts that are currently 
being undertaken to find pulsars in the Galactic Center region [20, 1 14-1 17]. 



1.7 Star Formation and the Galactic Center 

Star formation activity in the Galactic Center is an ongoing research topic (e.g. 
[54, 55, 118, 119]). Given the deep gravitational potential towards the very center, 
stars cannot form in the same way they do throughout the Milky Way. However, 
infrared observations have shown evidence that massive stars were formed in the 
hostile environment of SgrA* a few million years ago. VLA, ALMA and CARMA 
measurements suggest that star formation is still taking place in this region and 
has been going on during the past few 10^ years. A broad variety of possible star 
formation scenarios can be discussed. Massive stars may have formed within a disk 
of molecular gas resulting from a passage of a giant molecular cloud interacting 
with SgrA* and then dispersed after having formed these stars [120]. Dense clumps 
originating from the CND loosing angular momentum and falling towards the very 
center may also be a source of constant or episodic star formation in the Galactic 
Center [48]. A fraction of the material associated with these phenomena may have 
accreted onto SgrA* probably in an episodic or transient manner [121]. This process 
may in part be responsible for the origin of the 7 -ray emitting Fermi bubbles [122]. 



1.8 The Galactic Center on Larger Scales 

Radio polarization observations by the Parkes radio telescope in Australia have 
recently led to the discovery of giant radio lobes emanating from the Galactic nucleus 
[123, 124]. These lobes are largely coincident with the Fermi Bubbles discovered 
in the 7 -ray domain [122]. However, the radio outflows extend to even larger angu- 
lar scales, covering about 55-60° north and south of the Galactic plane. It is likely 
that the radio lobes — and the Fermi Bubbles — are the result of the concentrated star 
formation occurring in the Central Molecular Zone of the Milky Way rather than 
signatures of putative activity of the super-massive black hole. These Fermi bubbles 
may be related to the giant magnetized outflows from the Galactic Center [125-127] . 
These phenomena link the Galactic Center nuclei of nearby active galaxies and larger 
scale events (e.g. amounts of gas escaping from the potential well of the galaxy). 
Massive inflow of gas towards the centers, as well as star formation or jet driven out- 
flows from the nuclear regions, are frequently observed and are directly or indirectly 
linked with the accretion processes onto super-massive black holes. 
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2 Extragalactic Nuclei 



The observational results obtained on the Galactic Center during the past few years 
need to be put into perspective. The nucleus of our Galaxy has an extremely low 
luminosity. However, it also demonstrates that violent activity may take place at 
times. Hence, a comparison between the center of the Milky Way and the nuclei of 
galaxies hosting Low Luminosity Active Galactic Nuclei (LLAGN) appears to be 
imperative. 

Footprints of AGN feeding and feedback in LLAGN can be observed in many 
cases [128-130]. The study of the content, distribution and kinematics of interstellar 
gas is a key to understanding the fueling of AGN and star formation activity in 
galaxy disks. Current mm-interferometers provide a sharp view of the distribution 
and kinematics of molecular gas in the circumnuclear disks of galaxies through 
extensive mapping of molecular line (mainly CO and to some extent high density 
tracers like HCN, HCO+, CS, etc.). The use of molecular tracers specific to the dense 
gas phase can probe the influence of feedback on the chemistry and energy balance 
in the interstellar medium of galaxies. Radiative and mechanical feedback are often 
used as a mechanism of self-regulation in galaxy evolution as well as thermal and 
non-thermal AGN activity. In the disks of galaxies the evolution is predominantly 
expressed in star formation and the evolution of stellar populations and the ISM 
properties. In galactic nuclei the thermal part is represented mainly by the properties 
of the nuclear ISM and by the NLR and BLR regions while the non-thermal part is 
dominated by synchrotron/SSC emission due to SMBH accretion and the presence 
of jets. 

High angular resolution (< FO observations with sub(mm)-interferometers (IRAM 
PdBI, ALMA) in the context of the NUclei of GAlaxies (NUGA) survey (see, e.g., 
[128, 129, 131-133] and upcoming more ALMA NUGA papers) allow us to study 
the mechanisms responsible for fueling AGN and star formation activity in the cen- 
tral R<1 kpc disks of a sample of 25 active galaxies at the 10-100 pc scale. This 
study has revealed streaming motions towards and away from the nuclear region that 
do not necessarily have to be co-phased with current AGN activity. In several cases 
these observations also reveal the presence of molecular circumnuclear disks and 
massive molecular outflows. 

These phenomena are directly coupled to black-hole fueling, feedback and duty 
cycles that are essential for understanding the activity in galactic nuclei [134-137], 
see also [138, 139]. The distribution of gas and stars in nearby galaxies traced by 3-D 
studies of molecular, neutral and ionized gas provides a unique view of the role of 
the multi-phase medium in triggering and fueling nuclear activity in galactic nuclei 
on scales ever closer to the central black hole. Radio- and mm-interferometers and 
in particular 3 -dimensional imaging devices in the infrared and optical domain (like 
integral field units) allow us to obtain spectroscopic and photometric information at 
every position of the field-of-view. Such observations have been used to elaborate 
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comparative studies of gaseous and stellar dynamics in active and quiescent galaxies. 
Studying the inner kiloparsec of AGN is particularly important, since the activity and 
dynamical time scales become comparable in this region. These investigations also 
show that the variability observed in nearby radio-quiet AGN may provide parallels 
to our own Galactic Center. They also provide evidence that AGN duty cycles may 
be shorter than previously thought. 



3 Black Hole Laboratory: New Instrumentation 
and Perspective 

An improved link between theory and observations will be provided by new and 
upcoming instrumentation such as the future capability of measuring polarization 
in the X-ray domain [140, 141], and direct VLB I imaging of the event horizon 
region for e.g. M87 as a representative of the largest supermassive black holes, and 
SgrA* as a representative for low-mass SMBHs [59, 60, 142]. The SKA (Square 
Kilometer Array) will be key instrument to establish the census of pulsars at the 
Galactic Center, to determine the star formation history, and to provide the required 
precision to determine the spin and quadrupole of the central BH, testing the theories 
of gravity (see, e.g., [143]). 

The beam combining instrument GRAVITY^ at the VLTI (RI. Frank Eisenhauer, 
MPE, Garching) will be able to measure the NIR image centroid paths during flux 
density excursions of SgrA* [12, 144-146]. These paths will depend on the geomet- 
rical structure and time evolution of the emitting region i.e. spot shape, e.g. presence 
of a torus or spiral-arm patterns, an emerging jet component. Figure 7 shows the 
fringe tracking spectrometer of GRAVITY being under construction at the Univer- 
sity of Cologne. Operating on six interferometric baselines, i.e. using all four UTs, 
the 2nd generation VLTI instrument GRAVITY will deliver narrow angle astrometry 
with 10|Jias accuracy at the infrared K-band. 

While most of the mentioned geometries are currently able to fit the observed 
variable emission from SgrA*, future NIR interferometry with GRAVITY at the VLTI 
will break some of the degeneracies between different emission models. GRAVITY 
will be able to detect the positional shift of the photo-center of a flare at the Galactic 
Center within the ~20 min orbital time scale of a source component close to the last 
stable orbit, using the flares as dynamical probes of the gravitational field around 
SgrA*. In particular GRAVITY observations of polarized NIR light could reveal a 
clear centroid track of bright spot(s) orbiting in the mid-plane of the accretion disk 
(see, e.g., [1-3]). A non-detection of centroid shifts may point at a multi-component 



^http://www.mpe.mpg.de/2441478/Team. 
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integrated laser blocking unit Wollaston electro-mechanical 
optics dummy with laser dichroic mechanism focus drive 




integrated optics double prism temperature camera with detector 
collimator mechanism control zoom lens mount 

, ~ 65 cm 



Fig. 7 The fringe tracking spectrometer of GRAVITY being under construction at the University 
of Cologne [144] 



model or spiral arms scenarios. However, a clear wander between alternating centroid 
positions during the flares will strongly support the idea of bright long-lived spots 
occasionally orbiting the central black hole. 
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Extreme Mass Ratio Inspirals: Perspectives 
for Their Detection 



Stanislav Babak, Jonathan R. Gair and Robert H. Cole 



Abstract In this article we consider prospects for detecting extreme mass ratio 
inspirals (EMRIs) using gravitational wave (GW) observations by a future space 
borne interferometric observatory eLIS A. We start with a description of EMRI for- 
mation channels. Different formation scenarios lead to variations in the expected 
event rate and predict different distributions of the orbital parameters when the GW 
signal enters the eLISA sensitivity band. Then we will briefly overview the available 
theoretical models describing the GW signal from EMRIs and describe proposed 
methods for their detection. 



1 Introduction 

Extreme mass ratio inspirals (EMRIs) arise following the capture of a small compact 
object (CO) — a white dwarf, neutron star or stellar mass black hole — by a massive 
black hole (MBH) in the centre of a galaxy. The astrophysical processes that lead 
to the formation of EMRIs are described in detail in Sect. 2. The inspiralling CO 
loses energy and angular momentum through emission of gravitational radiation, 
and the initially wide and very eccentric orbit gradually shrinks and becomes more 
circular. EMRIs are among the most interesting gravitational wave (GW) sources 
that could be observed by the proposed eLISA detector. eLISA (evolving Laser 
Space Interferometer Antenna) is a space-based gravitational wave detector which 
is scheduled for launch in 2034. It will be sensitive to GWs in the frequency range 
0.1-100 mHz. Sources in this band include the mergers of massive black hole bina- 
ries, which will be observable up to a redshift z = 20, and numerous white dwarf 
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binaries in the Milky Way, in addition to EMRIs. We will discuss the event rate and 
the expected precision of parameter estimation for EMRI sources in Sect. 2. 

During an EMRI, the CO typically spends 10^-10^ orbital cycles in the eLISA 
band before plunging into the central MBH. We need to model the phase of GW 
signal from EMRIs with an accuracy of a fraction of a cycle in order to detect the 
signal and correctly extract the parameters of the binary system. This is a challenging 
problem, which has not yet been solved in full. Due to the extreme mass ratio, 
m/M 10“^ — 10“^, we can treat the problem perturbatively, considering the field 
of the CO and the emitted GWs as a small perturbation of the background spacetime 
of the central MBH. At the leading orders in mass ratio the internal structure of a CO 
is not important and so the CO is conventionally treated as a delta-f unction. As often 
happens in such an approximation, the self-field is divergent at the position of the 
CO, and requires proper treatment (regularization) [1]. The resulting perturbation 
has the form of a tail expression, and depends on an integral over the entire past 
history of the CO’s trajectory. In the limit that the mass ratio goes to zero, the motion 
is described by a geodesic. However, the mass of a CO is small but not zero and due 
to interaction of the self field of the particle with a background, the trajectory slowly 
deviates from a geodesic path [2]. This can be described effectively as the action 
of a force (self-force) on the inspiraling object. In practice, the geodesic trajectory 
is used to compute the tail integral entering the self-force, and the resultant force 
is used to update the geodesic trajectory accordingly. In Sect. 3, we will summarize 
various ways to compute the GW signal from EMRIs and describe how the evolution 
of the orbital motion can be described using an osculating elements approach. The 
CO may also be spinning and this spin is coupled to the background curvature and 
alters the trajectory of the CO, forcing it to deviate from the corresponding geodesic 
of a non- spinning body. The trajectory of a spinning particle (in the limit of vanishing 
mass ratio) is described by the Mathisson-Papapetrou equation. Attaching a spin to 
a point particle is not uniquely defined, leaving a freedom to choose the dipole 
moment of a body (see [3] for a description of spinning objects in the weak field 
approximation). This freedom manifests itself through the need to specify a spin 
supplementary condition (SSC) in order to obtain a unique solution to the equations 
of motion. In order to understand these complications, we consider in Sect. 3.3 the 
motion of a spinning particle in de Sitter space time. This space time possesses a 
non-trivial curvature but is still fully symmetric. Eor more details on the computation 
of the self-force and on the Matthisson-Papetrou equations we refer to other articles 
in this issue. 

Last, but not least we want to consider the question of detectability of GW sig- 
nals from EMRIs. The GWs generated by an EMRI system are characterized by 14 
parameters: two masses m, M, the dimensionless spin of the MBH, a, and its ori- 
entation, 0k,4>k\ six parameters describing the CO’s position and velocity at some 
fiducial time or equivalently the instantaneous shape and phase of the orbit at that 
time (eccentricity e, inclination of the orbital plane to the spin of MBH l, semi- 
latus rectum p and the initial phases 4^9,^ corresponding to the three coordinate 
degrees of freedom); the sky location of the source, 0, 0, and its luminosity distance, 
Di. Many of these parameters are highly correlated. The GW signal comprises a 
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superposition of orbital harmonics, with the number of harmonics and their relative 
strength strongly dependent on the eccentricity and binary orientation. The strength 
of the signal observed in the detector varies with time as eLISA moves around the 
sun (amplitude modulation) and the relative motion of the detector and the source 
induces a time-dependent Doppler modulation of the phase. The main challenge in 
detecting EMRIs is the multi-modality of the likelihood. The likelihood can be seen 
as a hyper- surface embedded in the 14-dimensional parameter space. It has multiple 
strong maxima and the main challenge is to find the highest (global) maximum. In 
Sect. 4 we describe algorithms to do this which were successfully demonstrated on 
the Mock LISA data challenges [4]. 

Throughout this paper we use geometrical units G = c = 1. 



2 Astrophysics of Extreme Mass Ratio Inspirals 

In this section we will consider possible channels leading to EMRI formation, the 
expected number of EMRI events that will be observed for eLISA and the likely 
accuracy with which eLISA will constrain their parameters. Then we will briefly 
summarize some of the potential impact of EMRI detections for astrophysics and 
fundamental physics. 

2.1 Formation of EMRIs 

The “extreme mass ratio” refers to the fact that the mass of the CO is of order of 
1-10Mq, while the mass of the central (capturing) object is in the range 10^-10^ Mq . 
Current astrophysical observations indicate that massive compact objects of this kind 
are present in the nuclei of all sufficiently massive galaxies for which the central part 
can be resolved. The best example is the nucleus of the Milky Way, in which a few 
dozen bright 0-B stars (so called S-stars) have been observed in Keplerian orbits 
around a central object with an estimated mass of 4 x IO^Mq. In addition, the 
compactness of this object suggests that it must be a massive black hole. 

These massive objects in the centres of galaxies are typically surrounded by clus- 
ters of stars. In the “standard” picture of EMRI formation, the stars are spherically 
distributed around the MBH (which should be approximately true for sufficiently 
large distances) and dense enough for efficient 2-body relaxation, i.e., mutual grav- 
itational deflection and contact collisions. The timescale for this process, the relax- 
ation time trix , is defined as the time required to change the angular momentum of a 
star by an amount Jc, where Jc is the angular momentum of a star on a circular orbit 
with the same semi-major axis. A smaller trix implies that stars can be more easily 
deflected on to very eccentric orbits with a small periapsis passage. If a CO object 
on an initially wide orbit is perturbed onto such a trajectory, it will lose energy to 
GW bursts emitted near periapsis (vp) and its orbit will gradually shrink. While the 
semi-major axis is very large, the CO can still efficiently interact with other stars 
at the apoapsis and could be either deflected onto a plunging orbit with Vp < 8M 
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or onto a wide orbit which does not emit appreciable GW radiation. To become an 
observable EMRI, the CO must remain on the highly eccentric orbit until its period 
becomes smaller than "^10^-10"^ s, at which point it is continually radiating GWs 
in the eLISA sensitivity band. While we will be primarily interested in such EMRIs 
here, the bursts of GWs produced during periapsis passages in the early stages of the 
process could also be potentially detected by eLISA if the event is in the nucleus of 
nearby galaxies [5]. 

When the stars interact gravitationally, they tend to divide the kinetic energy 
equally and, while equipartition is not reached in practice, this process causes more 
massive objects to sink deeper in the potential well of the MBH. This process is 
called mass segregation. As a result we expect stellar mass black holes to form a 
steep power-law density cusp around the MBH n(r) ~ r~^ with a 1.7 — 2.0, 
which dominates for r < O.lpc. The lighter stellar species form shallower density 
profiles with a ~ 1.3 — 1.5 [6]. The relaxation time is inverse proportional to the 
density of the CO and it should therefore be smaller for the stellar mass black holes. 

In order for an object to become an EMRI, it should efficiently dissipate energy 
through GW emission, and have a sufficiently low probability to be deflected onto 
a different orbit. This condition implies that the time scale for orbital decay by GW 
emission, tcw^ should be smaller than (1 — e)trix, where e is orbital eccentricity. 
Once the orbital period reaches P < lO'^s, the CO completely decouples from the 
cusp, which happens for orbits with semi-major axis uemri "" 0.05pc. 

Eor typical orbits around an MBH, the number of stars enclosed by the orbit is 
rather small and so the gravitational potential created by the “field” stars is not a 
smooth symmetric function. This gives rise to a torque acting on a CO on an orbit 
with semi-major axis of r ~ \fNm^/aco^ where A is a number of field stars 
with mass inside the CO orbit. If the precession of the CO orbit is slow compared 
to the timescale over which the distribution of field stars changes significantly, the 
CO experiences a nearly constant torque over some time. This mechanism, known 
as resonant relaxation, changes the angular momentum of the CO, but not its energy. 
The characteristic time scale associated with resonant relaxation, tRR, is significantly 
smaller than trix and so this process can significantly boost the EMRI event rate. 
Resonant relaxation plays an important role for orbits with aco S clemri [2-10]. 
However, for COs on eccentric orbits with small perhaps radii, the relativistic (GR) 
precession can be very high, which effectively destroys the resonant relaxation effect. 
The point at which this occurs is known as the “Schwarzschild barrier” [11]. The 
existence of this barrier means that resonant relaxation is not as effective at boosting 
EMRI rates as one might first think, although if the MBH has significant spin then the 
impact of the “Schwarzschild barrier” is somewhat diminished due to the lower value 
of the plunge periapsis for prograde orbits [ 12] . In this case, COs that would normally 
be considered as plunging and hence undetectable around a Schwarzschild MBH 
actually perform many cycles in the eLISA band and may contribute significantly to 
the event rate [13]. 

In this picture, the critical thing for having a high EMRI rate is to have com- 
pact objects in the “loss-cone” (orbits with impact parameter sufficiently small that 
they can be captured or tidally disrupted by the MBH). Several channels have been 
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suggested that can replenish the loss-cone and thereby significantly boost the EMRI 
rate, including triaxiality of the potential (non- spherical galactic nuclei) [14] or the 
presence of massive perturbers (such as intermediate mass BHs, and/or molecular 
clouds) in the vicinity of the orbits [15]. 

The complex dynamics of this standard capture scenario for EMRI formation 
means that the astrophysical event rates are very uncertain. To estimate event rates we 
will use a current best guess of 400Gyr“^ for Milky Way-like black holes, dominated 
by EMRIs in which the CO is a black hole. This rate is taken from [16]. 

As well as this standard mechanism for EMRI formation, there are two other 
plausible channels. 

Tidal binary disruption. It is possible that within the radius of influence of a MBH 
there is a binary fraction of at least a few percent [17]. If a binary approaches the 
MBH it can be tidally disrupted and, if this happens, one star is ejected at very high 
velocity while the other star becomes tightly bound to the MBH. The captured CO 
is expected to end up on an orbit with a semi-major axis of a few hundred AU and 
a pericentre distance of a few to tens of AU, implying that it will circularise by the 
time it enters the eLISA frequency band [17, 18]. This is a distinct feature of this 
formation channel, since in the standard scenario we expect the EMRIs to have a 
significant residual eccentricity even at plunge [19], 0.1 — 0.3. We observe in 

the Milky Way so called “hyper- velocity” stars [20]. which are moving away from 
the galactic centre with large velocities. The best current explanation for the presence 
of short-lived S- stars in the vicinity of the Milky Way MBH is that they came there 
following the tidal disruption of binaries, while the observed hyper- velocity stars are 
the thrown away companions [21]. 

Formation of stellar remnants in a disk. Observation of active galactic nuclei 
suggests the presence of a circum-nuclear gaseous disk accreting onto the MBH. If 
the disk is thick and sufficiently massive, the outer part could fragment and form 
stars. If migration through the disk is sufficiently slow, stars formed in this way 
could evolve to form compact object remnants (neutron star or black hole) which 
subsequently spiral into the MBH as an EMRI in the equatorial plane (the accretion 
disk, at least its inner parts, is expected to be aligned with the MBH’s equatorial 
plane [22]). The interaction with the gas is also likely to keep the orbit of a CO close 
to circular, so the distinct feature of this channel of EMRI formation is a circular 
orbit in the black hole equatorial plane. 

By measuring the orbital parameters we will be able to say which of these three 
channels provides the most likely explanation for how the EMRI was formed. Eor 
more details on the dynamics of galactic nuclei we refer the reader to the compre- 
hensive review [23]. 



2.2 Expected Event Rate Estimation 



In this section we follow [24] and briefly outline how the expected event rate of 
EMRIs observed by eLISA can be estimated. To make this estimation we require 
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an intrinsic event rate 1Z(M, a, ji), where M is the mass of the MBH, a its spin in 
units of M and fi = m/M the mass ratio. The intrinsic event rate tells us how 
often EMRIs are formed (i.e., how often they enter the eLISA sensitivity band) per 
galaxy hosting a MBH with parameters M, a. The mass ratio parameter tells us the 
nature of a CO, i.e., whether it is a stellar mass BH, neutron star or white dwarf. 
As discussed in the previous subsection, due to mass segregation we expect stellar 
mass BHs to be the most likely candidate for EMRIs, so we choose a canonical value 
for the CO mass of m = IOMq. We will normalize the mass of the MBH by the 
mass expected for a Milky Way type galaxy Mmw 3 x IO^Mq. So far we do not 
have information about the distribution of the spin of MBHs of this mass. X-ray 
observations of some active galactic nuclei provide information about the spin of 
accreting MBHs in the centre, but those black holes are of higher mass > 10 ^ Mq 
and embedded in the gaseous circum-nuclear disk. In addition all present estimations 
of the spin are heavily model dependent and could vary significantly depending on 
the underlying assumptions [25]. Therefore, here we assume a uniform distribution 
of the spin within its physical range a G (—1,1). The estimation of the intrinsic 
event rate is a very challenging task, as described above and in more detail in [23], 
which depends quite heavily on the underlying assumptions about the efficiency of 
mass segregation, the relative importance of different EMRI formation channels and 
the interplay between resonant relaxation and the “Schwarzschild barrier”. Here we 
adopt the estimate derived in [16] which for stellar mass BHs is 



where (3 ~ 0.19. 

If the duration of EMRI signals was significantly shorter than the observation 
time, then the observed event rate would be determined by computing the distance 
at which the signal-to-noise ratio (SNR) equals some detection threshold pthr and 
then multiplying the rate per unit volume by the volume contained by that distance, 
assuming a uniform distribution of EMRIs in the local Universe. However, EMRIs are 
long-lived, and the SNR can be accumulated for as much of the inspiral as coincides 
with the eLISA observation. Eixing all the parameters of the EMRI, we can compute 
the SNR as a function of the time left to plunge, tpi. As we increase tpi from zero, 
the SNR first increases, then reaches a maximum before starting to decrease. There 
is a decrease of SNR for large tpi because the finite observation time means that we 
are ultimately only observing systems that are rather wide, with not very efficient 
GW emission, and with emission primarily at low frequencies where acceleration 
noise rises rather steeply. This means that if an EMRI is at all detectable, the SNR 
as a function of tpi intersects the line SNR = pthr at two times, t early, Uate, and we 
can define the EMRI observable lifetime as r(A/) = tiatei.\) ~ teariy{\), where A/ 
corresponds to all the other parameters of the EMRI (besides tpi) which we have fixed. 
If EMRIs plunge at a rate IZ per year in a particular galaxy, then rlZ gives the expected 
number of events from that galaxy (after appropriate averaging over parameters A/). 




( 1 ) 
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Among all parameters describing the EMRI system the most important are M, a, 
and we denote the remaining parameters as A/. We define M{M, a, z)dMda as 
the number of MBHs per comoving volume with mass M e [M, M + JM], spin 
a G [a, a + da], and at redshift z. We make two further assumptions (i) that the 
mass and spin distributions are independent; and (ii) that the distribution of MBH 
mass and spin are independent of redshift. The first assumption reflects our level of 
ignorance, and the second assumption is reasonable given how far we can observe 
EMRIs (with eLISA we will able to see EMRIs up to Zmax ~ 0.7). In this range 
we can ignore the evolution of masses and spins with z. Under these assumptions 
M{M, a, z)dMda = (dn/d\n M)(M)d\n Mp(a)da, where p{a) is the probability 
distribution function for the spin J p(a)da = 1. As described above, we assume this 
is uniform in our calculations, but we keep it here in the equation for completeness. 

The expected event rate is then 



N eLISA = 



POO rMhigh 

/ dz dlnM da1Z(M, o)r(M, a, z, Xi) 

J Z — -0 J aiiqid j 1 



' z= 

dn dVc 

dlnM dz 



( 2 ) 



A/ 



Here {dVc/dz)dz is the comoving volume in the redshift range [z, z + dz\. The tri- 
angular brackets denote the averaging over other EMRI parameters A/ . We note that 
in practice the intrinsic event rate could also depend on some parameters from the set 
\i (depending on the channel of EMRI formation). The mass function {dnjd In M) 
can be deduced from measured galaxy luminosity functions using the observed 
L — a, M — a correlations. In the range of interest to eLISA, this functions approx- 
imately fiat [26], so we adopt 



dn _ / M Y 

dlnM ~ \3 X lO^M© / 



( 3 ) 



With canonical values no = 0.002 Mpc a = 0. If we assume these canonical 
values, with [3 = 0.19, a mission duration of 2 years and a detection threshold 
of pthr = 20, we estimate that eLISA would observe 25-50 events in two years 
[27, 28]. This spread in the predicted number of events comes from uncertainties in 
the waveform model and system parameters, but a much larger uncertainty, which is 
not taken into account here, arises from the uncertainty in the true value of IZ. 



2.3 Science Return from Observing GW Signals from EMRIs 

Detection of EMRIs and measurement of their parameters provides unique astro- 
physical data which cannot be obtained by any other means. We expect to be able 
to learn information about stellar populations in the centre of the Milky Way in the 
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future by observing pulsars in the nuclear stellar cluster region using the SKA [29]. 
Inferring similar properties of other galaxies through observations of EMRIs will 
also us to compare the nucleus of the Milky Way with nuclei of other galaxies. The 
number of observed EMRI events and the mass distribution of the COs will tell us 
about the physics of mass segregation, the masses and spins of stellar mass compact 
objects and about the steepness of the stellar cusps in the centres of galaxies. In 
addition, EMRI observations will provide precise measurements of MBH masses 
and spins in a new mass range. EMRIs will probe galaxies containing black holes 
with masses IO^-IO^Mq, and such galaxies tend to be of lower mass and not partic- 
ularly luminous in the electromagnetic spectrum. Extracting information about the 
nuclei of those galaxies is therefore very challenging, if not impossible, using elec- 
tromagnetic observations and eLISA therefore has tremendous potential to inform 
us about these systems. Observations show that the masses of black holes in galactic 
nuclei correlate with the mass, luminosity and the stellar velocity dispersion of their 
host galaxy [30]. These correlations imply that black holes evolve along with their 
hosts throughout cosmic time, but it is not yet known if this coevolution extends 
down to the lowest galaxy and black hole masses, since those systems may have 
differences in the accretion properties [31], dynamical effects [32], or cosmic bias 
[33]. eLISA observations of EMRIs will significantly improve our knowledge of the 
MBH mass function (e.g., inferring the parameter a in eqn. [3]), as well as allowing 
us to measure the intrinsic event rate (for example constraining the parameter (3 in 
eqn. [1]), determine the relative importance of different channels of EMRI formation 
and measure the spatial distribution (relative to the MBH) of different types of CO. 
This is made possible by the ultra-precise determination of EMRI parameters with 
GW observations. In Eig. 1 we show how accurately we expect to measure the most 
important parameters: MBH mass (M) and spin {a), CO mass (m), orbital eccentric- 
ity just before the plunge (end of inspiral) (Cpi). The last parameter, A 2, is a possible 
deviation in the MBH quadrupole moment away from the Kerr value, which will be 
discussed at the end of this subsection. 

In addition an observation of an EMRI will allow us to determine the luminosity 
distance to the source (Dl) with an accuracy of < 1% and to localize the source on 
the sky to about 0.2 square degrees. Such a fantastic accuracy is achieved because 
the source is long lived — the CO spends cycles in the close vicinity of a 

MBH. Using matched filtering we will be able to determine the phase of an EMRI 
to an accuracy of half a cycle, a fractional phase accuracy of All infor- 

mation about the binary system is encoded in the GW phase and so we can expect 
to make measurements of the intrinsic parameters to this same fractional accuracy. 
Measurements of the extrinsic parameters, such as sky localisation, are not as precise 
since these measurements come not from the phase but from the modulation of the 
GW signal (in amplitude and in phase) caused by eLISA s orbital motion. 

These precise phase measurements mean we can also use EMRIs to test the 
“no-hair” theorem: if the central massive compact object is indeed described by 
the Kerr metric, as general relativity predicts. The spacetime outside a stationary, 
axisymmetric object is fully determined by its mass, M/, and current. Si, multipole 
moments. Since these moments fully characterise the spacetime, the orbits of the 
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Fig. 1 Expected precision of parameter estimation from observed EMRI events, computed using 
the Eisher information matrix: MBH mass (M, dashed line), CO mass (m, solid line), MBH spin 
(a), orbital eccentricity before plunge (epi) and deviation of the MBH quadrupole moment from 
the Kerr value (Q) 



smaller object and the gravitational waves it emits are determined by these multipole 
moments. The emitted GWs therefore encode a map of the spacetime structure and 
by observing these gravitational waves with eLISA we can precisely characterise 
the multipole structure of the central object. Extracting the moments from the EMRI 
waves is analogous to geodesy. If the central object is a Kerr black hole, then all 
multipole moments are determined by its mass and spin (“no-hair” theorem): 

Ml + i Si = 

If we can measure the first three moments we can therefore check whether the central 
object is consistent with being a Kerr black hole. Eigure 1 shows that we should be 
able to measure a deviation in the mass quadrupole moment from the Kerr value, Q = 
\M 2 — to a precision of 6Q ^ (10~^ — 10~^)M^. EMRIs could therefore 

also serve as laboratories for testing fundamental physics. Eor more discussion on 
this topic we refer to [28, 34]. 
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3 Modelling the GW Signal from EMRIs 

For detection of EMRIs we will utilize matched filtering, this technique assumes that 
we can model the GW signal and then cross-correlate it with the data. The EMRI 
signal depends on 14 parameters (actually on 17 if we take into account the spin 
of CO), which we do not know a priori and need to infer from the measured data. 
To do this, we must generate many signals from a given model (templates) across 
the full, 14-D, parameter space to find the parameters that best fit the data (this 
set of parameters, which maximize the likelihood, are called “maximum likelihood 
estimators” of those parameters). We will describe the search procedure in detail in 
the next section. 

The presence of noise in the data stream causes the best-fit parameters to differ 
from the true parameters of the GW signal. The size of this difference can be estimated 
using the Fisher information matrix, as shown in Fig. 1. If the data is analysed using 
an inaccurate model there will also by systematic errors in the parameter estimates, 
which could be larger than the statistical errors from detector noise. It is therefore 
important to accurately model the GW signal coming from EMRIs, to ensure reliable 
estimation of parameters and improve the detectability, since a mismatch between 
the signal and template will cause a drop in the SNR and decrease in the observed 
volume by (SNR/SNRoptimal)^- 

In this section we will describe currently available models for EMRI signal and 
discuss their effectualness (if they are able to recover the optimal SNR) and faithful- 
ness (if the systematic errors in parameter estimation is below the statistical errors 
due to presence of the noise). Note that effectualness does not imply faithfulness: a 
model could recover a significant fraction of the SNR with a large systematic bias in 
the parameters. In other words, the shift in the parameters from the true values could 
(partially) compensate for inaccuracies in the model. 



3.1 Waveform Inventory 

Unlike the inspiral of a comparable mass binary, the merger (here we call it plunge) 
of a CO with MBH and subsequent ring-down are suppressed by a factor of the 
mass-ratio and are therefore not observable by eLISA. We therefore only need to 
model the inspiral part of the signal up to a plunge. However, for the whole of the 
inspiral observable by eLISA, the CO is orbiting in the strong-held region close to 
the MBH and moving at ultra-relativistic speeds. This makes modelling an EMRI 
signal somewhat different from modelling GWs from a binary of two nearly-equal 
mass MBHs. Here we briehy outline some of the currently available models for GW 
signals from EMRIs. More detailed description can be found in other papers in this 
volume. 

Post-newtonian expansion. The post-newtonian approach describes the GW signal 
as an expansion in velocity v. As mentioned above the CO in EMRI systems are fast 
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moving and spend 10"^- 10^ cycles in a regime where v is large. The EOB approach 
[35, 36] is the most suitable for modelling EMRIs by construction (the conservative 
dynamics reduces to the test-mass in the limit m/M ^ 0), however the dissipative 
part (fluxes) are needed to a very high post-newtonian order, which is not currently 
known. In addition, the analytic expressions for the fluxes are known only for nearly 
circular and nearly equatorial orbits, while we expect EMRI orbits to be both eccentric 
and inclined [37]. 

Analytic ''kludge” waveforms. This model was introduced primarily to study 
detection rates and parameter estimation for EMRIs [38]. The main advantage of 
these waveforms is that they are fast to generate, so they are suitable for large Monte- 
Carlo simulations, and they were extensively used to develop detection algorithms 
(see Sect. 4). This model is an extension of the work by Peters and Mathew [39], 
it represents emission from a CO in Keplerian orbit augmented by imposing (post- 
newtonian) relativistic precession of the orbital plane and the direction to perihelion. 
The dissipative evolution is taken from post-newtonian calculations. This model is 
not particularly accurate but it captures the main physical processes occurring in 
EMRIs. 

Numerical "kludge ” waveform, or semi-relativistic model. The idea of the numer- 
ical kludge waveforms is to combine an exact particle trajectory (up to inaccuracies 
in the phase space trajectory and conservative radiation reaction terms) with an 
approximate expression for the GW emission. By including the particle dynamics 
accurately, we hope to capture the main features of the waveform, even if we are 
using an approximation for the waveform construction. The idea was introduced in 
[40, 41] and was further evolved with some modifications in [42, 43]. 

The procedure to compute a numerical kludge waveform has two stages. Eirstly, 
a phase-space inspiral trajectory is constructed, i.e., the sequence of geodesics that 
an inspiral passes through, by integrating prescriptions for the evolution of the six 
constants of the motion (energy, angular momentum. Carter constant and three ini- 
tial phases). Initial work has used post-newtonian expressions (augmented by some 
consistency corrections and by fitting to solutions of the Teukolsky equations) to 
evolve these constants. This inspiralling trajectory is computed numerically thus the 
name “numerical kludge”. Once the trajectory has been constructed a waveform is 
generated by identifying the Boyer Lindquist coordinates along the trajectory with 
spherical-polar coordinates in a fiat space time and applying weak-field GW emission 
formulae, in particular the quadrupole-octupole approximation: 

where are the mass quadrupole and octupole moments and is the 

current quadrupole moment of the binary system, is a unit vector pointing from 
MBH to the position of a CO, and overdots denote time derivatives. These waveforms 
are somewhat slower to generate as compared with the analytic kludge due to the 
numerical integration of the orbital trajectory, but it is far more faithful up to the last 
month or less (semilatus rectum p ~ 6M) before the plunge. 
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Adiabatic inspirals based on Teukolsky formalism. The very first framework for 
black hole perturbation theory in a Kerr background was the Teukolsky formalism 
[44], which encapsulates all gravitational radiative degrees of freedom in a single 
“master” wave equation (the “Teukolsky equation”) for the Weyl scalars, and 4 ^ 4 . 
A key feature of this equation is that it admits separation of variables in the frequency 
domain, which effectively reduces it to a pair of ordinary differential equations. The 
Teukolsky equation has been solved in the frequency domain [45] and in the time 
domain [46], but both approaches assume the orbit that acts as the source of the 
perturbation is a geodesic. The rate of change in energy, angular momentum. Carter 
constant (averaged over several orbits) are evaluated from the gravitational wave field 
and then used to update the parameters of the geodesic in an adiabatic manner. This 
procedure misses the evolution of the other constants of motion (initial positions) as 
well as making the adiabatic assumption. As a result, the waveforms are not accurate 
on a very long time scale, but they are the most faithful model on time scales M^/m . 

Self -force waveforms. An accurate description of the self-force and its derivation 
is given in other papers in this volume, so we only briefly mention it here. As men- 
tioned above, the extreme mass ratio in an EMRI system allows the waveform to 
be determined using perturbation theory. The inspiralling object can be regarded as 
a small perturbation on the background spacetime of the central black hole, except 
very close to the small object. In the vicinity of the small object, the spacetime can 
be regarded as a Schwarzschild BH moving under the influence of an external tidal 
field due to the MBH. Matching these two regimes allows one to obtain an expression 
for the self-force acting on the CO. The self force can be seen to arise as a result 
of the interaction of the self field of the CO with the non-fiat background geometry, 
which causes the lines of force to be bent and act back on the CO. The self-force 
can be conventionally split into two parts: non-time symmetric (dissipative) and 
time-symmetric (conservative). The former part causes the inspiral and dominates 
while the latter part can be eliminated by a redefinition of the orbital frequencies at 
each instance, which means it is effectively second-order in mass ratio. The adia- 
batic Teukolsky based waveforms take into account only the dissipative part of the 
self force, neglecting the conservative part, which defines the domain of its validity. 
The self force is computed assuming the CO is moving on a geodesic, then it is 
used to adjust the geodesic (inspiral) before the self-force is recomputed again. The 
computation of the self force is somewhat complicated as it treats the CO as a delta 
function in the background spacetime, which requires mathematical apparatus for 
regularization of some divergent integrals. It is possible to subtract the singular part 
from the field equations (by finding a singular solution valid in the vicinity of the 
CO) and the resulting equations are manifestly regular and contain on the right hand 
side a smooth effective source [47], which allows the field equations to be coupled 
to the equations of motion and integrated. This procedure can be written for a scalar 
field (representing a CO carrying a scalar charge and ignoring the gravitational part 
of the self-force) as [48] 



(O0;a’^ = 5(x;z(r),i.(r)) 



( 5 ) 
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Du^ 

dr 



+ V/3 «>'■ 

m(r) 



dm 

dr 



-qul^ V/3 



( 6 ) 

(7) 



where O is the scalar field, q is the scalar charge, m the mass of the CO, S is an 
effective source term and is the CO four- velocity. Greek indices are being used 
to indicate space-time components, and a semicolon denotes a covariant derivative 
with respect to the background spacetime, ga[ 3 - A similar procedure can be applied 
to the gravitational field. So far only the self-force waveform for a Schwarzschild 
background has been computed, but recent progress has been rapid and so we expect 
the extension to Kerr to be completed within a few years. 

Numerical relativity waveforms. The ultimate goal would be to compute EMRI 
waveforms using numerical integration of the full GR field equations. State-of-the-art 
techniques have enabled the computation of waveforms for the last 20-50 cycles of 
the inspiral, merger and ring-down of comparable mass ratio binaries. The simulation 
of an EMRI requires the computation of a few orders of magnitude more cycles, plus 
the resolution of two very different spatial scales. This is far beyond the capability of 
current computational resources and techniques. In addition, the time step for explicit 
numerical integration is set by the smallest characteristic scale in the problem, which 
is the mass of the CO in this case. Numerical waveforms will be very useful for 
the calibration of current calculations based on perturbation techniques, but new 
numerical methods will have to be developed to handle EMRIs. 



3.2 Evolving Perturbed Geodesic Motion 



In this subsection we will focus on how we can compute the evolution of the orbit. 
The orbital evolution is the key ingredient for creating numerical kludge waveforms 
and waveforms based on the self force. In fact this is the same problem, the main 
difference is in how the waveform is computed from the orbital trajectory. To compute 
the orbital evolution we must solve the forced geodesic equation: 

= a^, ( 8 ) 

where a^ is the 4- acceleration. The acceleration is essentially the self-force, but the 
method we will describe here for solving this equation is also applicable to the case 
where a^ represents some other kind of external perturbation. This perturbation could 
be caused by a second (intermediate) MBH (if the CO in the EMRI is inspiralling 
into an MBH that is in a wide MBH binary), a molecular cloud or disc, another star 
or compact objector basically anything that can cause a slow modification of the 
geodesic orbit. Here we assume that the acceleration has been derived in some other 
way and are only interested in the effect it has on the inspiral trajectory. 
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The rest of this subsection summarizes results described in more detail in [49] . 
We use an osculating elements approach to evolve Eq. (8). If the perturbing force is 
small, ^ we can represent the perturbed trajectory at each instant by the unique geo- 
desic passing through the same position with the same velocity and see the orbital 
evolution as a slow variation of the constants of these instantaneously-tangent geodes- 
ics. A general geodesic in Kerr spacetime is described by eight constants of motion: 
J = {m, E, L^, Q, XO, to}, however two of them (CO mass and initial time 
m, to) are not truly dynamical, so we will work with the remaining 6: orbital energy 
(E), orbital angular momentum projected onto the spin of the MBH (/^), Carter con- 
stant (Q) and three initial phases (i/jo, Xo, M describing the initial position of the 
CO on the orbit in r, 0 and 0 respectively. The osculating element for of the equation 
of motion r = fgeo + Sf, is 



z“(r) = z“(7^(r),r), 



dz^ 

dr 



dz^ 

dr 



dJ^ dr 



o 

II 

5 


(9) 


- = sr. 


(10) 



The first set of equations describes a “geodesic” motion with slowly changing orbital 
“constants”, and the second set gives us the evolution of the orbital “constants” as a 
function of the perturbing force. 

The advantage of using the osculating elements approach is that we can use an 
adiabatic approximation (or, more generally, a two-time-scale expansion [50]) to 
evolve EMRIs, for which the radiation reaction time scale is much longer than the 
orbital time scale, allowing us to more easily study secular effects. 

The osculating elements approach was first used in [5 1] to study Eq. 8 in Schwarz- 
schild background, and was extended to Kerr in [49] . The authors in [49] wrote the 
osculating element equations on two different forms, using the Kinnersley tetrad or 
“Hughes” variables (i.e., in terms of the orbital constants and the total phase vari- 
ables [45]). In both cases, the appearance of an apparent divergence in the osculating 
equations of motion at turning points is avoided. The techniques were applied to a 
toy problem in which an EMRI was evolving under the influence of a perturbing 
force due to drag from surrounding material. This “gas-drag” force was taken to 
be proportional to the velocity of the inspiralling compact object. The two different 
approaches were shown to give identical results, and the comparison of the exact and 
adiabatic solutions to the problem identified the domain of validity of the adiabatic 
approach. Although the gas-drag problem was considered only to illustrate the meth- 
ods, it yielded interesting results. In particular, it was found that the influence of the 
drag force was to drive the inspiral of the object, but also to increase the eccentricity 



Tn fact this formalism does not assume the force is small — there is a unique geodesic passing 
through any given point with a particular velocity and so any trajectory can be described as an 
osculating geodesic. However, the approach is most useful when the force is small since then the 
trajectory remains almost geodesic and parameterising it in terms of instantaneous geodesic motion 
is useful. 
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of the orbit and decrease the orbital inclination. A gravitational wave driven inspiral 
would tend to show a decreasing eccentricity and so these two types of perturbing 
force would be distinguishable in an EMRI observation. 

Osculating elements were also used to generate inspirals in a Schwarzschild back- 
ground under the influence of the gravitational self-force in [52]. The formalism 
developed for Kerr inspirals in [49] has not been used for any other studies so far, but 
this will be done once suitable models for perturbing forces are available. Another 
type of orbital perturbation, which can also be interpreted in terms of a perturbing 
force acting on a geodesic, is the influence of the spin of the CO on the trajectory. 
This will be discussed in detail in the next subsection. 



3.3 Spinning Particle in de Sitter Space-Time 

In this subsection we will consider a spinning CO. There are several contributions to 
this proceedings which describe the motion of a spinning body in a given background 
in great detail. Here we will give only a brief summary, then show how we can 
formulate the motion in terms of the osculating elements approach described in the 
last subsection. To understand the motion of a spinning CO in the MBH spacetime, 
we will first consider a simpler problem. We will describe analytically the motion of 
a spinning test body in de Sitter spacetime. 

The motion of a test mass in an arbitrary spacetime is governed by the Mathis son- 
Papapetrou equations 



( 11 ) 

DrS°^f^ = ( 12 ) 

The first complication is that the 4-momentum and 4- velocity are not parallel 

= mu^ -\- . (13) 

Here denotes a covariant derivative with respect to the proper time, square brack- 
ets denote the anti- symmetric part, is the Riemann tensor of the background 

space time and is the spin tensor. The difference between p^ and 

means that there is an ambiguity in what we call the mass — we can define this as 
m = p^Ua or = p^pa- The second complication is that there is not a sufficient 
number of equations to determine all of the unknowns. In order to close the system 
we need to introduce an additional “spin supplementary condition” (SSC). There is 
an arbitrariness in choosing the SSC, which is usually attributed to how we choose 
the representative word line of a test mass (this is equivalent to choosing a dipole 
moment of a spinning CO). The main reason that the SSC is needed is that there is 
an ambiguity in the definition of the spin tensor for a point mass. The point mass 
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is an approximation of an extended body (for which the spin tensor is well defined) 
when the size is much less than the radius of curvature of the background spacetime. 
The most common SSCs are 

O') = 0, (ii) UaS^^ = 0, (in) WaS^^ = 0 (14) 

SSC (i) is usually referred to as the Tulczyjew condition [53], (ii) is the Frenkel- 
Pirani condition [54, 55] and (iii) was first introduced in [56] and is referred to as 
the u; -condition. 

As mentioned above we want to write the Mathisson-Pappetrou equations as a set 
of first order equations using the osculating elements approach. To achieve this, we 
must first write the equations of motion in the form of a forced geodesic equation 
for a non-spinning particle: 



^dxP dx^ 

L Y « — 

ds^ ds ds 



(15) 



which we want to rewrite later in the form (10). We denote the SSCs (i), (ii) and (iii) 
as “T” and “F” and “w”, and consider first the “T” condition, ph = 0. In that 
case, we have M = const, but rh(u^,u^) = S^^UaUf]. We can introduce a new 
time variable. A, with dA = mdr and use to denote the coordinate velocity in the 
new coordinates := dx^/dX = jm. The equations then become 



+ Tp^^S^^uP = 
dA 

dA m2 y 4M2 + SPl^SP^Rp/Spa ) ’ 



(16) 



which now have no explicit dependence on m and so we can proceed to write them 
in osculating element form. In particular, we can differentiate the third equation with 
respect to A and then use the first equation to get an equation for ^ -h Tpcp^u^u'^ 
that depends only on position and velocity, and not on derivatives of u^. The explicit 
expression for the covariant total derivative of is given by: 



~dX 



= m^luP+ uis j + Tpa'^uPu'^ 

- (i” + 

- Is-”./ V/ (h% - • 



( 17 ) 
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where we made use of the following abbreviation: 

' 4M‘^ + M^S'^^S'^’'Rnuex' 



The third equation in (16) gives an implicit dependence of on the spin tensor and 
velocity (p^ = p^{u^ , S^^)) which we can use to integrate the (second) equation 
for the spin tensor. 

We note, however, that the standard osculating element formulation of the equa- 
tions implicitly imposes the condition that u^Ua = 1 and hence fa = 0. This is no 
longer true after this change of variables. However, there is a way to put the equations 
into this standard osculating element form when there is an arbitrary force on the 
right hand side. To tackle this problem we can again make a change of integration 
variable to a new variable, q say. We then have 



&q 

dA dA &q 

dX^ \dA/ dq^ dA^ dq 
and the equations become 

d^x^ dx^ dx^ 1 / d^q dx^\ 

_l_ p a rfa I ^ I 

dA^ dA dA (d^/dA)^ \ dA^ d^ / 

We can impose the orthogonality condition be solving 



(18) 



(19) 



_ fgdx°'/dq 

dA2 ga/3(dx°‘/dq){dx^/dq) 



and the force becomes 



/ 



fa 



1 / .a f^dx'^/dq dx“ \ 

(d^/dA)2 (■' gn^(dxP/dq)(dx‘'/dq) dq ) 



( 20 ) 



( 21 ) 



So, to compute the new force we need to know the value of dq/dX. We can set this 
to one initially and then simultaneously integrate the equation 



dq \dA/ dq/dX g^y{dx^ /dq){dx^ /dq) 

This is a somewhat complicated procedure, but the right hand sides of the new 
equations now do not depend on derivatives of velocity and so the problems identified 
above no longer apply. 
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The (hi) SSC (w; -condition) is the most suitable for the osculating elements 
approach. In this case, we use an arbitrary normalized time-like vector w^Wa = ^ 
and impose the following conditions 



The vector field is parallel propagated along the world line of the test mass and 
these conditions imply = mu^ and m is conserved. This SSC is the most suited 
for the osculating elements approach. 

Alternatively one can linearize the equations with respect to the spin In this 
case the relation between the velocity and the 4-momentum takes the simple form 



and the supplementary conditions “T” and “F” coincide. The equations of motion 
are now given by: 



As is apparent from (26), this form of the equations of motion is suitable for the 
osculating orbits method, yielding a perturbing force of the form 



We will now stop considering a general background space time and focus on a 
particular choice: de Sitter. This is a spacetime with a constant curvature which is 
at the same time fully symmetric. This allows us to solve the equations of motion 
analytically and to gain better understanding of the trajectories and the role of the 



The motion of spinning test particles in de Sitter spacetime has previously been 
investigated by [57] where it was found that under the Tulczyjew SSC, the trajectory 
is a geodesic with the parallel transport of an appropriately defined spin vector. In 
addition, under the Frenkel-Pirani SSC, it was found that the trajectory is perturbed 
about a geodesic by an oscillatory motion but the final solution for the trajectory was 
left as a numerical integration. We focus on this oscillatory motion in more detail 
and relate it to motion under the u; -condition. 

The first Mathisson-Papapetrou equation (11) simplifies in de Sitter spacetime to 



= 0 , = 0 . 



(23) 




(24) 







(25) 




(26) 




(27) 



SSC. 
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where I is a real constant, related to the Ricci scalar via R = 12//^. At first glance, 
it might appear that the Frenkel-Pirani SSC will lead to the simplest trajectories, 
as is identically zero in this case. However, due to the difference between 

4-momentum and 4-velocity in (13), this generically leads to non-geodesic motion. 

We can write the equation of motion under both the Frenkel-Pirani and the 
u; -condition in the same functional form, given by 



DrU^ = ±- 



UOT] 



a[3iiv 



FfjUnSjy 



~V{FaS<^f + {F^Fn 52 ’ 

UaU^ = 1, 



DrF°‘ = 0, = 0, 



S„S^ = -s^ 



(29) 



where is a spin 4- vector constructed from the spin tensor such that the SSC is satis- 

fied, S and uo are real constants, and is the permutation symbol. Differentiating 
the equation for results in 

= -LO^ (m“ - F") , (30) 

demonstrating that can be viewed as a forcing term for the oscillations. 

The frequency of oscillation uj and the forcing term are different for the two 
SSCs: for the Frenkel-Pirani case, we find 



while under the w-condition, 

F^ = 

w 

LO = 

As we have an explicit equation for could now numerically integrate, 

using the method of osculating elements, to find the trajectory. Instead, it is possible 
to find a general analytic solution to (29) for the motion of spinning test particles 
in de Sitter spacetime. As a starting point, we note that the solution in Minkowski 
spacetime has been determined previously (see [56, 58, 59], for example). Under 
both the Tulczyjew and u; -conditions, the particle follows a geodesic whilst under 
the Frenkel-Pirani condition, the particle undergoes purely circular motion, boosted 
along a central geodesic. 

Since the de Sitter and Minkowski geometries are both maximally symmetric, it 
might be expected that a similar solution representing circular motion will be found 
in de Sitter spacetime. We are interested in the 16 components of the position 
velocity forcing term F^, and spin 4-vectors, using spherically symmetric 



77a F ^ a 

F = — , 
F 2 M 

^ “ ~Y' 



(31) 

(32) 






52 



2M/2’ 



(33) 

(34) 
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Static coordinates. Using the 10 isometries of the de Sitter spacetime and the four 
constraints in (29), it is possible to show that a completely general solution to the 
equations of motion is given by 



where r and are free constants. This solution explicitly corresponds to circular 
motion about the origin at a frequency that tends to u in the limit that / ^ oo, 
consistent with the Minkowski result. 

In spacetimes with fewer symmetries than de Sitter, we do not anticipate that such 
an exact analytic solution for the trajectory can be found, although progress can still 
be made. Different classes of pole-dipole orbits have been identified in the equatorial 
plane of Kerr [60] and it has been shown numerically that the motion of spinning 
test particles in Schwarzschild is of a helical nature [61]. The existence of the exact 
de Sitter solution can be used to further our understanding of spinning test particle 
trajectories in these more physical spacetimes. 

In addition, the similarity of the solutions in de Sitter under the u; -condition and 
the Frenkel-Pirani SSC will hopefully lead to a better understanding of these SSCs. 
Particularly, we note here that the product of covariant frequencies, cjp o;w = 1/^^ 
is dependent only on the curvature of de Sitter and not on the multipole moments of 
the test particle. If a similar fundamental link between the two SSCs exists in other 
spacetimes, it might allow us to infer properties of the Frenkel-Pirani trajectory by 
numerically integrating the simpler equation of motion under the u; -condition. 



x^{t) = = uW, r, 0 = —, (j) = , 



(35) 




(36) 



(37) 




4 Detecting GW Signals from EMRIs 



In the previous two sections we have described the formation of EMRI systems and 
how the gravitational waves they generate can be modelled. Both those problems 
are very hard and not yet solved in full, and those astrophysical and theoretical 
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uncertainties in EMRI rates and in models of the GW signal are coupled to the 
data analysis challenges. Before we describe specific data analysis algorithms for 
extracting EMRI GW signals from the detector data we will give a general description 
of the signal and the problems we face in data analysis. 

As mentioned earlier, an EMRI generates 10^-10^ gravitational waveform cycles 
in the eLIS A band. We therefore need to model it very accurately if we want to avoid 
systematic biases in the inferred parameter estimates. The expected signal-to-noise 
ratio (SNR) from those systems is not very high (probably less than 50), but during 
the Mock LISA Data Challenges (MLDCs) successful extraction of EMRI signals 
with SNRs as low as 19 was demonstrated, using the same approximate EMRI model 
(the analytic kludge described earlier) for both injection and recovery. As described 
in Sect. 1 , the EMRI signal depends on 14 parameters, if the spin of the CO is ignored, 
which is justifiable for mass ratios less than 10“"^. It is convenient to describe the 
EMRFs dynamics in the frame fixed relative to the spin axis of the MBH. The spin 
direction is usually taken to be the z-axis, but we have full freedom in choosing the 
orientation of the x, y-axes, and this choice is degenerate with the initial azimuthal 
position of the CO. Since the signal is long lived (stays in band for the entire duration 
of observation) there is a significant modulation of the amplitude and the phase of the 
waveform caused by the orbital motion of the detector. This allows us to measure the 
source sky position with a precision of a few degrees for signals with SNR 20 [4] . 

EMRIs are primarily GW sources for a future space-based detector like eLISA 
[28], and the data analysis discussion presented in this section is based on analysing 
data from such an instrument. Here we will always assume that the instrumental 
noise is Gaussian (but not white) and that EMRIs are the only GW sources in the 
data. These are not realistic assumptions for eLISA like data, but make the problem 
more tractable and the resulting algorithms are still likely to be effective when the 
assumptions are relaxed. Eor the purpose of developing data analysis algorithms and 
EMRI detection strategies we use somewhat simplified models of GW signal (in 
particular the analytic “kludge” model described in the Sect. 3.1), which capture the 
main physical features present in the expected signal (periapsis and orbital precession, 
slow inspiral, Doppler modulation, multiple harmonics) and are also fast to generate 
numerically and so can be used for computationally expensive parameter estimation. 
The need to quickly evaluate hundreds of thousands of waveforms to perform data 
analysis is the main factor which prevents us from using more realistic models. If the 
data analysis algorithms do not use any model specific features, they can be easily 
ported to use the best GW signal model available at the time the data is analysed. 

There are two data analysis challenges associated with the search for EMRI sig- 
nals. The first one is to find a signal in the noise, in other words to test the null 
hypothesis that the observed data is consistent with noise only. This could be a prob- 
lem for signals with SNR below 20, however we do expect to see a few dozen signals 
from EMRIs with SNR above 20, which should be detected with high statistical 
significance. Therefore we will concentrate on such reliably detectable signals. The 
situation will become more complex when other GW signals are present in the data 
(especially the foreground from Galactic white dwarf binaries) and/or with realistic 
instrumental noise. We do expect some environmental and instrumental artefacts to 
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be present in the data and the LISA Pathfinder [62] measurements (scheduled for 
launch in July 2015) will allow us to simulate a more realistic eLISA data stream in 
the near future. 

The second problem is what we will focus on in the rest of the current section. 
The large dimensionality of the parameter space of possible signals makes a grid- 
type search completely infeasible, so instead we will rely on (pseudo)-stochastic 
search methods, primarily based on Markov chain Monte-Carlo (MCMC) techniques. 
Various implementations of MCMC for searches for EMRIs signals are described in 
[4, 63, 64], but the basic idea is to construct a chain which moves predominantly in 
the direction of increasing likelihood. The complication is that the EMRI likelihood 
hyper-surface has numerous local maxima some of which could be as much as 70- 
80 % of the global maximum and these local maxima are widely separated in the 
parameter space. The problem is similar to finding the tallest tree in a forest. A 
standard MCMC based search will reach a local maximum and get stuck there for 
a significant number of steps. Theoretically MCMC has a non-zero probability of 
exploring the whole parameter space and finding the global maximum, but in practice 
it can get stuck on a strong local maximum for a very long time. Since we consider 
here only clearly detectable signals, when we refer to a detection we will mean 
successfully finding the global maximum of the likelihood (which is near the true 
parameters of a simulated signal and by “near” we mean comparable to the expected 
statistical deviations due to the presence of detector noise). 

In order to detect a GW signal from an EMRI we need an algorithm which can 
explore efficiently a large part of the parameter space and at the same time concentrate 
more on regions of high likelihood. Parallel tempering MCMC is one such algorithm 
and it was used in the MLDCs by N. Cornish [65]. Here we describe two other 
methods which share the same core principle, based on understanding and exploiting 
the reason for the presence of local maxima. To understand this reason, we need to 
look carefully at the GW signal. The GW signal from an EMRI is a superposition 
of harmonics of three fundamental frequencies, which slowly evolve as the CO 
inspirals. 



Hi) = ^ X (39) 

l,m,n \l,m,n J 

These fundamental frequencies (instantaneously or for a geodesic motion) are 
associated with three degrees of freedom: the radial frequency is associated with 
eccentric motion from periapsis to apoapsis and back; the polar frequency (6>— motion) 
is associated with spin-orbital coupling and the resulting precession of the orbital 
plane around the spin axis of the MBH; and finally the frequency of azimuthal 
motion [45, 66]. The frequencies evolve under radiation reaction (self-force) on 
a time scale associated with the mass ratio, which is for EMRIs significantly 
longer than the orbital time scale. As the CO spirals toward the MBH the over- 
all amplitude of the signal is slightly increasing but the amplitude of individ- 
ual harmonic depends on the instantaneous orbital parameters like eccentricity 
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and inclination. Due to orbital circularisation under radiation reaction [39] the 
amplitude of some harmonics (high /) will decrease while that of some other 
(low /) harmonics will increase, but in all cases the amplitude of each harmonic 
is a smooth and slowly varying function of time. We can construct a periodogram of 
the EMRI signal, and it looks like a comb in the time-frequency plane, see Fig. 2 as 
an example. 

The global maximum corresponds to the case when two combs representing a 
signal and a search template coincide exactly in amplitude everywhere in the time- 
frequency plane. The reason for the local maxima is a partial overlap between the 
signal harmonics and the harmonics of a template. These might not be the same 
harmonics (the same set of /, m, and the strength of a given local maximum will 
depend on how long (in frequency and in time) the harmonics of the signal and 
template coincide. 

In the search for a GW signal we use matched filtering which is an optimal detec- 
tion technique in the presence of Gaussian noise and can be seen as an inner product 
of the data x(t) = n(t) s(t) with a template h(t). Here n(t) is the instrumental 
noise and the signal s(t) = s(t; A) depends on the parameters of the source (A), 
which we are trying to estimate. The inner product is defined as 

(x,h) = 2m / df, (40) 

Jo Snif) 

where tilde denotes a Fourier transformed quantity and Sn (/) is the one-sided power 
spectral density of the noise in the detector. If the signal is confined within a narrow 
frequency band around /o, so that we can treat Sn(fo) as almost constant, the inner 
product can also be written in the time domain in a simple form: 
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[ x(t)h(t)dt, (41) 

^nUO) Jo 

where T is the observation time (or duration of a template). The assumption that 
Sn{f) is approximately constant over the signal evolution is valid for signals of 
duration up to 2-5 months (dependent on the parameters). Since the amplitude of an 
EMRI signal is a slowly growing function of time, one can see from Eq. (41) that 
the SNR (SNR^ = (s,s)) roughly grows as the square root of the observation time. 
We can use a maximum likelihood estimator to determine the GW parameters. The 
likelihood ratio is given by 



Ad) = = (x,ft(A))-i(ft(A),ft(A» C 42 ') 

P(x|0) 

where P(x\h(X) is the probability that the data v would be observed when a signal 
corresponding to the specified set of parameters is present in the data and R(x|0) is 
the probability that the data would be observed when no signal was present. Usually 
the likelihood (or log-likelihood) can be maximised over some parameters of the 
signal analytically, whereas maximisation over other parameters requires a numerical 
search. The analytically maximised likelihood is quite often referred as the F-statistic 
[63, 64, 67]. 

Based on the Eqs.(40) and (41) we can introduce a cumulative likelihood (or 
cumulative F- statistic) in the time and/or in the frequency domain by varying the 
upper limit of integration. If the template matches the signal exactly we expect to 
have steady growth of the cumulative E- statistic as a function of time or frequency 
(in other words it should be a monotonic and not decreasing function). In the case of a 
local maxima we will observe “bursts” of increase in the F-statistic around instances 
of time (or frequency) where one or more harmonics of the template and signal 
match. This is illustrated in Eig. 3, the left panel shows schematically a harmonic of 
a template successively intersecting and overlapping with two different harmonics 
of the signal, one of which (in black) if stronger than the other. 

In the right panel of the same Eigure, we show the corresponding accumulation of 
the F- statistic, and the instances of two intersections are clearly seen here as a rapid 
increase in the F- statistic. This illustrates nicely the reason for the presence of strong 
local maxima in the parameter space which we hit while constructing the Markov 
chain: harmonics of a signal can reproduce (overlap) one or a few strong harmonics of 
a signal for a span of time sufficient to accumulate a significant value of the detection 
statistic. This makes a “curse” into a “blessing”: we can use the information of the 
locations of the local maxima to guide the search to find the global maximum of 
the likelihood. This is a key part of the search for EMRIs and the main basis for 
the two specific methods described in the following subsections. We find many local 
maxima by running multiple MCMC chains with different seeds, and then analyse 
the accumulation of the F -statistic to identify the parts (harmonics) of the signal that 
were found at each of those local maxima. Then we use this information to run a 
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Fig. 3 Cartoon showing two harmonics of a signal in green and black {black being stronger) and 
a harmonic of the template intersecting the signal at two instances {left plot). In the right plot we 
give the corresponding accumulation of the F-statistic in time. The two significant positive slopes 
(in pink) corresponds to two instances of overlaps between a signal and a template 



constrained MCMC (as described in Sect. 4.1) or place them on the time-frequency 
plane and fit them with the harmonic tracks of a template by varying the source 
parameters (as described in Sect. 4.2). 



4.1 Constrained Markov Chain Monte Carlo Search 

In this subsection we will summarize the method which was successfully used to 
analyse the Mock LISA data challenge [4] and described in greater detail in [63]. In 
this method we split the data into 6-month long subsets and start by analysing each 
of them separately, before joining them together once we have started to lock onto 
the signal. 

In the first step we perform a stochastic search: we randomly draw parameters 
from the prior range and evaluate their likelihoods. This is continued until multiple 
statistically significant points have been identified in the parameter space. Those 
points are then refined by running small MCMC chains seeded at those points. The 
local maxima are then analysed to find common harmonics (in time and frequency). 
These are identified as sections of harmonics of the true signal, although usually we 
do not know the associated harmonic indices. 

In the second step we run a constrained MCMC. The sections of harmonics found 
in the first stage serve as constraints. We do realise that those constraints might 
not be exact, so we first run the MCMC with the frequency constraints and adjust 
the other parameters then we release the constraint and allow the code to adjust the 
constrained frequencies before fixing these again and repeating. This works very well 
in practice, even if the frequency of some of the (especially weak) harmonics was 
not determined very accurately initially. We also run several chains simultaneously 
to check for convergence to a global maximum. 
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In the third step we join the 6-month-long subsets of data together and let the chains 
adjust to match together the best found solutions in each subset. This method was used 
to analyse simulated data with a single relatively strong (SNR between 50 and 130) 
EMRI signal [63]. The identification of a signal was remarkably good with an ultra- 
precise recovery of the system parameters. The technique was also used to analyse 
the third Mock LISA data challenge data set, for which there was a single data set 
with five weak (SNR about 20) EMRI signals. The technique successfully identified 
two signals, while for the other three signals we identified that they were present 
but did not determine reliable estimates of their parameters before the challenge 
deadline. 



4.2 Detection ofEMRIs Using a Phenomenological 
Template Family 



In this subsection we summarise the method described in [64] . The main idea of this 
approach was to detect GW signals from EMRIs in a model independent way using a 
minimal set of assumptions about the signal: (1) the orbital motion can be described 
by six slowly (on the radiation reaction time scale) changing quantities; and (2) the 
signal is represented by a set of harmonics of those (three) orbital frequencies with 
slowly changing amplitude. Those are rather mild constraints and should describe 
also “dirty” EMRIs where the orbital motion is perturbed either by the astrophysical 
environment or by a deviation in the spacetime geometry of the central BH [34, 68]. 

We can use the assumption of slow frequency and amplitude evolution to decom- 
pose the phase and amplitude of each harmonic as a Taylor series and perform the 
search over the coefficients of the Taylor expansion. We call this a phenomenologi- 
cal EMRI template — the relationship between the Taylor series coefficients and the 
physical parameters depends on the specific model for the GW signal from an EMRI 
system. By searching over phenomenological parameters (Taylor coefficients) we do 
not restrict ourselves to any specific model within the framework of our assumptions 
above. The truncation of the Taylor series and the number of harmonics included 
depends mainly on the SNR of the signal: for weak signals we have to use a higher 
order expansion in order to match the signal for a longer time. Detection of EMRI 
signals in this model independent way allows us to relax stringent requirements on 
the accuracy of the theoretical model and to test alternatives to the assumption of a 
CO inspiral occurring in a pure vacuum Kerr spacetime. 

Here we describe the simulations performed in [64] . Three month long data sets 
were simulated containing an EMRI signal (SNR = 50) using the numerical kludge 
as a model. Multiple MCMC searches using the phenomenological templates were 
carried out with different starting seeds. The results were collected and analysed for 
the presence of local maxima. Eor each identified maximum a patch of the signal 
harmonic which was found was extracted and placed on the time-frequency plane. 
The resulting map looks as presented in Eig. 4. In this example the injected source 
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was a strong signal and the method recovered 13 harmonics. In more realistic cases 
we would expect to recover 3-5 harmonics only. We note that the strong harmonics 
(at low frequency) are better recovered (through the full duration of the observation). 
Notice also that the last month of the data is recovered less well than the first two, 
which is due to the orbital motion of the detector — the antenna beam pattern during 
the last month is pointing away from the source. 

In the second stage it is necessary to assume a certain EMRI model, so that 
the found harmonics can be identified and the physical parameters of the system 
recovered. In particular it is here that we can assume several alternatives: a CO 
spiralling toward a Kerr MBH, a CO spiralling into a massive boson star, a “dirty” 
Kerr black hole (a bumpy BH or a complex astrophysical environment). Once the 
model is assumed, we can find the set of parameters which give the best fit to the 
found set of harmonics (in amplitude and in their evolution). One can use a simple 
chi-square test of goodness of fit to estimate how well the assumed model describes 
the observed harmonic tracks and hence make a statement about the model. Results 
for the recovery of orbital parameters if the same model is used for recovery and 
signal generation were presented in [64] . 



5 Conclusion 

In this article we have described one of the most interesting GW sources for the future 
space based gravitational wave observatory eLISA. We have briefly described the 
various channels for EMRI formation and expected event rates. Then we went through 
an inventory of available models for the GW signal generated by EMRIs. We also 
briefly discussed the osculating element approach for integration of the forced (under 
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radiation reaction) motion of a CO in Kerr spacetime, and its application to the case 
of a spinning CO. One non-trivial question is the influence of the spin supplementary 
condition on the computed motion of a spinning CO and we have addressed this by 
looking at a simplified case: the motion of a spinning test mass in de Sitter spacetime. 
This should provide guidance on how to proceed in the case of a Schwarzschild 
or Kerr spacetime. Finally we have described the challenges which we will face in 
extracting GW signals generated by EMRIs from eLIS A data. The main problem is to 
search for a global maximum of likelihood in the multidimensional parameter space, 
when multiple strong local maxima are also present. We have described how one can 
extract useful information about the signal from the locations of those local maxima 
in order to direct the search to the correct solution. In addition we have outlined the 
possibility that these methods can be used to verify that the central massive compact 
object is indeed described by the Kerr metric, as predicted by general relativity. 
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Level Sets of the Lapse Function 
in Static GR 



Carla Cederbaum 



Abstract We present a novel physical interpretation of the level sets of the (canoni- 
cal) lapse function in static isolated general relativistic spacetimes. Our interpretation 
uses a notion of constrained test particles. It leads to a definition of gravitational force 
on test particles and to a previously unknown uniqueness result for the lapse func- 
tion. In Sect. 5, we discuss photon spheres in static isolated relativistic spacetimes 
and relate them to the level sets of the lapse function. 



1 Static Isolated Relativistic Spacetimes 

Static isolated relativistic spacetimes have been studied from a number of perspec- 
tives including regularity, asymptotics, construction of explicit solutions etc. They 
serve as models of static isolated star and black hole configurations. The standard 
example is the Schwarzschild family of spacetimes 



modeling the vacuum exterior region of a black hole or spherically symmetric matter 
distribution. Here, M = mGc~^ is the mass parameter and denotes the standard 
metric of the round sphere. 

N is cdMcdthQ (canonical) lapse function. For M = m = 0, we find the Minkowski 
spacetime as a special case in the Schwarzschild family. 

More generally, a static relativistic spacetime is given by a smooth Lorentzian 
metric ds^ which has a smooth timelike Killing vector field X = dt which is 
hypersurface-orthogonal with respect to the induced Levi-Civita connection V : 
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Fig. 1 The time- slices of a 
canonically decomposed 
static spacetime 
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^[oV^X^]=0. (3) 

It follows that the (canonical) lapse function N := J—ds'^(X, X) is positive and 
time-independent. The metric locally splits into a warped product 

ds^ = -N^c^dt^ + g, (4) 

where is a time-independent Riemannian metric on a time-slice {t = const}. All 
time-slices are isometric and extrinsically flat, cf. Fig. 1. 

A static spacetime with metric of the form (4) is said to model an isolated 
system if 

9ij ^ij (5) 

N ^ I (6) 

as r ^ 00 and if the topology of the time-slices is that of Euclidean 3-space outside 
a ball. Thinking of the time- slices as only modeling the exterior region outside the 
(assumedly finitely extended) matter distribution and outside all black holes, let us 
additionally assume that ds^ satisfies the vacuum Einstein equations. Making use of 
the time-translation symmetry, the vacuum Einstein equations reduce to the static 
vacuum equations 



ARic = (7) 

AA = 0 (8) 

on any and every time-slice. Here, Ric denotes the Ricci curvature tensor, the 
covariant Hessian, and A the covariant Laplacian corresponding to the 3 -dimensional 
Riemannian metric g on the time-slices. 
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While the Riemannian metric g captures the geometry of the time-slices, the 
lapse function N incorporates the information of how we have to arrange or stack 
the (isometric) time-slices in order to reproduce the static spacetime, cf. Eq. (4) and 
Fig.l. 



2 Physical Interpretation of the Lapse Function 

What is the physical significance of the lapse function N1 Does it play a role similar to 

that of the Newtonian potential in static Newtonian gravity? Surprisingly, it (almost) 

does, and in a number of ways: 

• Newtonian limit: The pseudo-Newtonian potential U \= c^lnN converges to 
the Newtonian potential in the Newtonian limit (along any suitable family of static 
isolated spacetimes possessing a Newtonian limit in the sense of Ehlers’ frame 
theory), cf. [1, 2]. 

• Asymptotic behavior: The pseudo-Newtonian potential U = c^lnN asymptoti- 
cally behaves like 

mG ^ / 1 \ 

as r ^ 00 , where m is the ADM-mass of the system, cf. [3]. Moreover, the next 
order term in the asymptotic expansion of U corresponds to a center of mass term 
of the same form as in the Newtonian setting 

mG mGz • x / 1 \ 

u=-- — ( 10 ) 

cf. [2] . Both asymptotic decay statements are made precise using weighted Sobolev 
spaces. 

• Equipotential surfaces: Relativistic test particles constrained to a given 2-surface 

generically accelerate along unless E^ is a level set of the lapse function N 
and thus also of the pseudo-Newtonian potential U = In A. It is thus justified 
to call those level sets equipotential. We will describe this phenomenon in more 
detail in Sect. 3. 

• Uniqueness of lapse: N and thus also U = c^XnN are uniquely determined 
outside the matter distribution by the spatial metric g - and everywhere in a given 
time- slice by the metric g and the matter variables induced on any time- slice by the 
energy momentum tensor. We will sketch how this follows from the equipotential 
nature of the level sets in Sect. 4. More details as well as two alternative proofs 
can be found in [2, 4]. 

• Degrees of freedom: Introducing asymptotically fiat wave-harmonic coordinates 
(x^) on a time-slice, N (or, alternatively, U = In N) together with the coordi- 
nates uniquely determine the components gtj of the spatial metric of a given static 
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isolated spacetime. So, in this sense, static isolated spacetimes have four degrees 
of freedom, namely those corresponding to the choice of {N , , x^) . This 

coincides with static isolated Newtonian gravity, where the degrees of freedom 
are given by the Newtonian potential and three Galilei coordinates. 



3 Equipotential Surfaces and Gravitational Force 

In static Newtonian gravity, the (negative) gradient of the Newtonian potential defines 
the force F on a unit mass test body. This has a well-known consequence for the 
equipotential or level set surfaces: if a test body is constrained into one of the level 
set surfaces, the gravitational force F is perpendicular to the path of the test body and 
does thus not have a tangential component. Hence the test body does not tangentially 
accelerate along any level set surface, see Fig. 2. If, on the other hand, a test body 
is constrained to an arbitrary surface it will in general accelerate due to the 
tangential component of the force along 

Surprisingly, the “same” is true for the level sets of the lapse function in a static 
isolated spacetime. To see that, we need to replace the constrained Newtonian test 
bodies in the above picture by “constrained (relativistic) test particles”. In order to 
define these, recall that a timelike curve /i(r) is called a (freely falling relativistic) 
test particle if it is a critical point of the time functional T [p] given by 

T[p] := r |A(r)|Jr. (11) 

Jro 

In the context of static isolated spacetimes, the warped product structure of the metric 
captured by (4) allows us to think of a timelike curve //(r) as having temporal and 
spatial components such that 



/i(r) = (r(r),x(r)). (12) 

Accordingly, let us say that a timelike curve p(r) is constrained to a surface E^ C 
[t = const} if x(t) e E^ for all r. In other words, we ask the curve p(r) to stay 
within the warped 3-dimensional timelike cylinder (— oo, oo) x E^ “over” E^ in the 
spacetime. We then define a constrained (relativistic) test particle to be a timelike 
curve yu(r) which is a critical point of the time functional T [/i] as in (1 1) but subject 
to the constraint that it (and all its competitors in the variation of T [p]) is constrained 
to a given surface E^ C {^ = const}. 

Equipped with this notion of constrained test particles, let us say that a closed 
surface E^ sitting in a time-slice {t = const} of a static isolated spacetime is an 
equipotential surface if every test particle constrained to E^ is a geodesic in E^ 
with respect to the induced 2-metric, i.e. does not accelerate within E^, see again 
Fig. 2. This is a geometrized analog of the Newtonian notion of equipotential surfaces. 
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Fig. 2 A (Newtonian) test 
body constrained to a surface T? 




A straightforward analysis of the Euler-Lagrange equations of the functional T [/i] 
subject to the constraint that x{r) G for all r shows that is an equipotential 
surface in a given static isolated spacetime if and only if E^ is a level set of the lapse 
function N . Thus, in their effect on constrained test particles, the level sets of the 
lapse function N in static isolated spacetimes play precisely the same role as those 
of the Newtonian potential in static isolated Newtonian gravity. 



3.1 Gravitational Force 

In Newtonian gravity, the equipotential property of the level sets of the potential stems 
from the fact that the (negative) gradient of the potential determines the gravitational 
force per unit mass. It is thus tempting to define gravitational force per unit mass 
as the (negative) gradient of the lapse function N in our context. Our Newtonian 
limit analysis^ however suggests that we should replace the lapse function N by the 
pseudo-Newtonian potential U = c^lnN . Observe that U has the same level sets 
as N . 

Moreover, it is more adequate^ to take the gradient of U not with respect to 
the induced spatial metric g but instead with respect to the conformally equivalent 
Riemannian metric 7 := N'^ g. We suggest to call 7 the pseudo-Newtonian metric 
of the given static isolated spacetime. 

Finally, it is important to realize that a notion of gravitational force constructed 
by the above equipotential principle will only apply to test particles.^ It will not 



^cf. Sect. 2 for a sketch of and [2] for more details on the Newtonian limit analysis of static isolated 
spacetimes. 

^This follows from the Newtonian limit analysis combined with a more detailed study of the 
geometry of pseudo-Newtonian gravity, cf. [2] . 

^Our definition of gravitational force can be extended to general timelike curves that are not neces- 
sarily geodesics. These can be interpreted as test particles that are subject to not only gravitational 
but also to non-gravitational forces (e.g. electro-magnetic ones). 
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readily"^ apply to extended bodies. This distinction is irrelevant in static Newtonian 
gravity due to the superposition principle for the linear Poisson equation satisfied by 
the Newtonian potential. 

So let yu(r) = x(r)) be a test particle in a static isolated spacetime with 

metric ds^ as in (4). Assume that ji has mass m or that 



ds^ 




(13) 



for all r. We then define the gravitational force on the test particle /i(r) as 



F \= 



(14) 



where ^ V denotes the (3-dimensional) 7 -covariant gradient. Furthermore, we define 
its (3-dimensional) acceleration^ by 

a:=^Vdx^. (15) 

dr dr 



Here, ^ V denotes the (3 -dimensional) ^-covariant gradient. A straightforward com- 
putation shows that with these definitions of gravitational force and acceleration, the 
second pseudo-Nowtonmn law of motion 



F = md 



(16) 



holds for all freely falling test particles of mass m, see [ 2 ]. 



4 Uniqueness Results 

In a static isolated spacetime, the Einstein constraint equations on every time-slice 
reduces to 

R = 0. (17) 

We observe that the lapse function N does not appear in this constraint equation. 
Using Choquet-Bruhat’s local uniqueness theorem (see e.g. [5]) for the Einstein 



^The author has some first ideas how to generalize the notion of gravitational force presented here 
to extended bodies. A second Newtonian law of motion for extended bodies, however, seems more 
difficult at the moment. 

^The acceleration of the test particle is induced from the Lorentzian metric ds^ and thus naturally 
refers to the spatial metric g and not to the conformally transformed metric 7 because the definition 
of test particles relies on the “dynamics” of the spacetime. On the other hand, the definition of 
gravitational force uses the analogy of pseudo-Newtonian and Newtonian effects and thus should 
be formulated in pseudo-Newtonian terms. 
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equations, this implies that the spacetime corresponding to a given spatial metric 
g and lapse function N is in fact independent^ of the lapse function AL It follows 
that N is indeed unique: Suppose there was a second lapse function N such that 
^ 1 as r ^ 00 . Then by the above, the levels of N and N would be detected 
by constrained test particles. Those only depend on the Lorentzian metric ds^ which 
we have s^n to be independent of the lapse function whatsoever. Thus the level sets 
of N and N coincide so that TV = / o A^ for some real valued function /. The static 
vacuum equations (7) and (8) applied to TV and TV then imply 

0 = ATV = A(/ o N) 

= f" oN ||VAf ||2 + / o Af AAT 
= /"oAr||VAr||2 

SO that f" = 0 and thus TV = aN + (3 for some real numbers a, f3. Moreover, 

= VRic 
= {aN + (3) Ric 
= a + /3Ric 
= + f3 Ric. 

If g is not everywhere flat, this implies (3 = 0. Finally TV,TV^lasr^oo leads 
to ct = 1 so that TV = TV. 

We interpret this result as saying that “there is only one way of synchronizing time 
at different locations in a geometrostatic spacetime such that one sees staticity” just as, 
for a Riemannian geodesic, “there is only one way of walking along a geodesic such 
that one does not accelerate (up to affine re-parametrizations)”. The affine freedom of 
the parameter along the geodesic does not make an appearance in the static isolated 
spacetime picture because we fixed the lapse function to asymptotically converge to 
1 at spacelike infinity and therewith fixed the time unit. 



5 Photon Spheres 



It is well-known that the Scharzschild spacetime (1) and (2) with positive mass 
parameter M = mGc~^ possesses a so-called photon sphere at r = 3M. This is 
to say that photons (aka null geodesics) initially tangent to the timelike cylinder 
(— 00 , oo) X {r = 3M) remain tangent to it or that “photons get caught in the sphere 
[r = 3 TIT}”. Moreover, each photon’s energy and frequency is constant in time (as 



^However, we do assume that the lapse function exists in the first place. 
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observed by the static observers N~^dt). This is a very interesting phenomenon. It 
has proved relevant for understanding questions related to dynamical stability and to 
gravitational leasing.^ 

As we have seen above, the positive mass Schwarzschild spacetimes are prime 
examples of static isolated spacetimes. It is thus natural to ask whether more gen- 
eral static isolated spacetimes can also possess photon spheres or whether the phe- 
nomenon is restricted to the spherically symmetric Schwarzschild case. To the best 
knowledge of the author, this question was first raised in [6]. 

To study this question, let us define^ a photon surface in a static isolated spacetime 
with metric ds^ of the form (4) as a surface C {^ = const} such that every photon 
(i.e. null geodesic) initially tangent to the cylinder (— oo, oo) x remains tangent to 

it. A photon surface possessing the property that the tangential photons have constant 
energy and frequency along the photon surface (in the eyes of the static observers) 
will be called a photon sphere. It turns out that this property is equivalent to constancy 
of the lapse function N along the photon surface, see Lemma 2.7 in [8]. 

An analysis^ of the null geodesic equation combined with the GauB-Codazzi- 
Mainardi equations and the static vacuum equations (7) and (8) shows that a photon 
sphere E^ in a static isolated spacetime must necessarily have constant mean curva- 
ture (or expansion). Furthermore, a photon sphere E^ has constant (intrinsic) GauB 
curvature and the normal derivative of the lapse function o{N) must also be constant 
along E^. 

In [8] , the author shows that a static vacuum isolated spacetime with metric as in (4) 
possessing a — connected — photon sphere E^ must be isometric to a Schwarzschild 
spacetime outside the photon sphere under a mild technical condition. This gives a 
partial answer to the question raised above; for a priori disconnected photon spheres, 
the issue will be addressed in [9], together with other results on photon surfaces. 

This theorem can be interpreted as a photon sphere uniqueness theorem; its proof 
in fact mimics Israel’s proof of static black hole uniqueness [10]. It allows to identify 
a given static vacuum isolated spacetime as in fact being a Schwarzschild spacetime 
whenever it is known to possess a photon sphere. 

More precisely, if E^ C [t = const) is a photon sphere then we can^^ use the 
lapse function A as a coordinate in the spatial slice [t = const). As Israel showed 
in [10], the static vacuum equations (7) and (8) can be rephrased as inequalities on 
each level set of the coordinate N . Integrating those inequalities from the photon 
sphere all the way to spatial infinity, we obtain inequalities relating the mean and 
GauB curvatures of the photon sphere to the ADM-mass m of the spacetime. Recall 
that we have already derived above that both of these curvatures are constant. 



^See [4] for more information. 

^See [6-8] for the origins of this definition. 

^cf. [8] for an exposition of this analysis. 

^^Making the same mild technical assumption as Israel [10] that the lapse function foliates the 
region exterior to the photon sphere. 



Level Sets of the Lapse Function in Static GR 



821 



When combining the static vacuum equation (8) with the asymptotic decay of the 
spatial metric g discussed in (9) we can relate^ ^ the ADM-mass m of the spacetime 
to the normal derivative v{N) of the lapse function via 

f 

— — / u(N)da = m. (18) 

47tG J^2 

This physical insight proves that the integrated inequalities are in fact equalities. This, 
however, implies that Israel’s inequalities must be identities on each level set of N. 
From this, it is straightforward to compute that the spacetime is in fact Schwarzschild 
with N playing the role of a rescaled radial variable. For more details, we refer the 
reader to [8]. 
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of Black Holes 
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Abstract In this paper, we discuss how the shadow of a Kerr black hole depends 
on the motion of the observer. In particular, we derive an analytical formula for the 
boundary curve of the shadow for an observer moving with given four- velocity at 
given Boyer-Lindquist coordinates. We visualize the shadow for various values of 
parameters. 



1 Introduction: What Is the ^^Shadow”? 



For an observer at Boyer-Lindquist coordinates (ro, the shadow of a black 
hole is defined as that region of the observer’s celestial sphere which is left dark if 
all light sources are distributed on a sphere with radius ri > ro- To calculate the 
shadow, we consider light rays, i.e. lightlike geodesics, which are sent into the past 
from the observer’s position. Then, the boundary curve of the shadow corresponds 
to the limiting case between geodesics going towards the horizon darkness) and 
geodesics going to the sphere at ri with light sources brightness). The limiting 
case are geodesics that asymptotically spiral towards (unstable) lightlike geodesics 
which fill a specific spatial region, the photon region /C, and are propagating on a 
sphere. Consequently, the shadow of the black hole is a mapping of this photon region 
1C and not of the horizon. 

Starting with the theoretical work of Bardeen who for the first time calculated the 
shadow of a Kerr black hole correctly [1] we now reach the time where it seems to 
be possible to observe the shadow of the black hole near Sgr A* at the center of our 
Galaxy. The expected image is shown by calculations of how the shadow of a black 
hole would look like if matter, in the form of an accretion disc, a corona, or a jet, is 
included in the model. These calculations are based on ray tracing and GRMHD, see 
e.g. [2-7]. It is hoped that the shadow of Sgr A* will indeed be observed in the near 
future within the Event Horizon Telescope project, see [8], or the Black Hole Cam 
project. 
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In the following, we discuss how differently moving observers at a given position 
see the shadow of a Kerr black hole. A detailed discussion of a more general space- 
time describing a Kerr-Newman-NUT black hole with cosmological constant can 
be found in [9] . There, we demonstrate how the shadow is influenced by a charge, 
the cosmological constant or the NUT parameter. Since there one can also find 
a discussion of metric properties and visualizations of the photon regions of the 
black holes, we restrict ourselves in this proceedings volume to the Kerr case. Our 
construction of the shadow is a geometrical one, based on the geodesic equation and 
ignoring the influence of matter. 



2 The Kerr Metric 

The Kerr space-time is a stationary, axially symmetric type D solution of the Einstein 
vacuum equations that describes a rotating black hole with mass m and spin a. In 
Boyer-Lindquist coordinates (r, ip, t) the Kerr metric can be written as [10, p. 314] 



dv^ = ^ ( A i sin^ ^ ~ 

-f ^(Ax — a(E -h ax) sin ^ dt dp — sin^ '^9) dt^ (1) 

where we use the abbreviations 

E = -h (<2 cos , X = ^ sin^ d, A = — 2mr -\- , (2) 

and rescaled units; hence, the speed of light and the gravitational constant are nor- 
malized (c = 1, G = 1). The coordinates t and r range over ]— oo, oo[, while ^ 
and p are standard angular coordinates on the two- sphere. Whereas the parameter m 
for the mass of the black hole could take all values in M+ the absolute value of the 
spin parameter a is bounded by m since the event horizon of the Kerr black hole is 
at r + \/nfi — a^. 



3 Calculating the Shadows of Black Holes 

As the geodesic equation in the Kerr space-time has four constants of motion — the 
Lagrangian C, the energy and the z -component of angular momentum 

dC . . dC , . 

^ ~ 9ttt ^ 4 ” 9^t^ ■> ( 3 ) 
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plus the Carter constant K [11] — the lightlike geodesics (£ = 0) are given by four 
separated equations of motion 



E/ 

Tiip 



X(L^ -Ex) (S + QX)((S + ax)E - aL^) 
sin^ A 

Lz - g((^ + ax)E - aLz) 



sin^ t? 



K - 



(XE - 



sin^ "D 
((S +ax)£ -aLj) 



A 

=: ©W 
2 



A/: =: R{r). 



(4a) 

(4b) 

(4c) 

(4d) 



The existence of the photon region 1C, i.e. the region filled with lightlike geodesics 
staying on a sphere r = constant, is crucial for calculating the shadow because it 
will give us a parametrization of the shadow’s boundary curve. By (4d), the sphere 
conditions r = 0 and r = 0 imply that R(r) = 0 and R'(r) = 0. Hence, 

„ {CE + ax) - aLsf 2r{(E + ax) - oLe) 

E-E = T . Ee = , (5) 

A r — m 



where Lr = 



L 

and Kr = Solving for these constants gives the expressions 



Kr = 



4r^A 
(r — m)^ ’ 



uLe = (S +ax) - 



2rA 



m 



( 6 ) 



for spherical light rays. Since 0 < wie find by (4c) an inequality characterizing 
the photon region 



/C: (2r A — E(r — m))^ < 4(2^r^A sin^ (7) 

Through each point (r, of JC there is a light ray propagating on a sphere. Plots and 
a detailed discussion of the photon region for different space-times can be found in 
[9, Figs. 3-5]. 



4 Viewing the Shadows of Black Holes 

For determining the shadow of a black hole we consider an observer at Boyer- 
Lindquist coordinates (ro, '^o) and assume, for the sake of simplicity, that the light 
sources are distributed on a sphere with radius rR > ro- 

Lightlike geodesics reaching the observer can be divided into two types of orbits. 
There are geodesics which passed the sphere with light sources and there are those 
coming from the horizon. Thus, our observer would see brightness in the direction 
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of light rays of the hrst type and darkness for the other ones. The boundary curve 
of the shadow is therefore given by lightlike geodesics that spiraled from one of the 
unstable spherical light orbits of the photon region 1C. 

The shape of the shadow depends on the observer’s state of motion. At Boyer- 
Lindquist coordinates (ro, we choose an orthonormal tetrad adapted to the 
symmetry of the space-time [10, p. 307] 



eo 



(E + ax)dt + adip 



(ro,^o) 



ei 




(ro,^o) 



dip + xdt 
Ve sin'?^ 



{ro,'^o) 




(ro,^o) 



( 8 ) 



It is chosen such that eo ± are tangential to the principal null congruences of our 

metric. Here, is interpreted as the four- velocity of an observer at (tq , because 
it is a timelike vector; ^3 points into the direction towards the center of the black 
hole. An observer with this tetrad is called a standard observer in the following. 

If another observer at {ro, "do) moves with velocity v = {v\, V2, V3), v = \v\ < 
1 = c, with respect to our standard observer, we have to modify the tetrad. The 
four- velocity of the moving observer is 



viei -h V2e2 + V3e3 + ^0 , 

eo = , . (9a) 

Vl — 

From 'eo,e\,e 2 ,e 3 we hnd an orthonormal tetrad 'eo,'e\,'e 2 ,'c 3 with the Gram- 
Schmidt procedure by adding ^3, ^1, e 2 ~va this order-successively to ?o 



(1 - vfjei -h vi(v 2 e 2 + eo) 



e2 + V2eo 




e3 



{l - vj - vf)e3 -h V3(viei + ^ 2^2 + eo) 

-yj\ — V'f — V2 S/1 — V'^ 



(9b) 



Note that 2/ = ei if Vf = 0, i.e., for v = 0 this procedure recovers the tetrad 
^0, ^2, ^3- As before, the spacelike vector 23 corresponds to the direction towards 

the black hole. The interpretation of i'l and ?2 becomes clear if we introduce celestial 
coordinates, see (11) and Fig. 1. Then, ^ei and 22 point into the north-south respec- 
tively the west-east direction. 
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Fig. 1 The direction of each light ray reaching the observer is given by the celestial coordinates 0 
and 'ip [Eq. (1 1)] of their tangents, see figure on the left. These points (6, ip) on the celestial sphere 
{black ball) can be identified with points in the plane {red ball) by stereographic projection, see 
figure on the right. The dashed {red) circles mark the celestial equator 6 = t:! 2 respectively its 
projection 



We can now describe the tangent vector of a light ray A (5’) by Boyer-Lindquist 
coordinates 



\ = rdy -\- + (fdip idt ( 10 ) 

and by celestial coordinates 0 and for our moving observer 

A = cr ( — ?o + sin cos 'ipe\-\- sin 0 sin %pe 2 -\- cos 0 es) (11) 

where 0 = 0 corresponds to the direction towards the black hole. For the tetrad (9) 
we observe the following dependencies regarding (8): 

^0 ~ kofdr -\- ko(pO(^ “h kofdt, 

ei = ki^d^ + k\^d^ + htdt, ( 12 ) 

^2 ~ k2(pd{p ~\~ k2tdf, 

€2 — “h “h k^ipd^p “h 



A = cr((— kor + k^r COS 0)dr + {—k{)^ + k\^ sin 0 cos ip + k^^ cos 0)d^ 

-\-{—kQip + k\^ sin 0 cos + k 2 ^p sin 6> sin + k^^p cos 0)dip (13) 
+(—kot + kit sin 0 cos + k 2 t sin sin + k^t cos 0)dt ) . 



Hence 
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Comparing coefficients of dr, and di^ in (10) and (13) yields 

f = cr(— ^Or + ^3r COS 0), (14) 

^ sin 0 cos + ^ 3 ^ cos 0), (15) 

ip = (r{—k{)ip + k\^ sin 0 cos p + k 2 ip sin O^inp -\- k^ip cos 0). (16) 



These equations can be solved easily for cos 0 and sin p (using sin 0 cos p = 
+ cos 9), 



cosO = 



If + kor 
k3r 



sin V’ = 



ks, 



+ koy: 



Ilf (f ^ + ^0^)) - (^3VP - i^f + kor) 

kipkl^ - {^f -\-korf 



(17a) 

(17b) 



where p, 'd and r have to be substituted from the equations of motion (4b), (4c) and 
(4d); since r and ^ are given as quadratic expressions, the signs have to be chosen 
consistently. 

The remaining scalar factor a can be calculated analogously to a in [9, Eq. (20)]. 
At first, express (9a) in terms of the tetrad {dr, d^,d^,dt) 



eo 









(E + ax)dt + ad(p ^ d^p xdt 
— + v\d^ - V2- ^ 



^/A 



sin'd 



— V2\/~Kdj 



) 



As cr = g[X, eo), see (11) we get a from (1), (10), and (18), 



1 

v^Vi - 



/aL^ - (E +ax)E 

\ x/A 



+ i’lE'i) — V2 



Ez XE 
sin'i^ 



(19) 



where p, ^ and r have to be substituted from (4b), (4c) and (4d) as above. 

With this expression, (17) indeed describes the boundary curve of the black hole’s 
shadow for a moving observer. The boundary represents lightlike geodesics which, 
if you think of sending them from the observer’s position into the past, reach the 
photon region asymptotically. Each such geodesic must have constants of motion 



4r^A 



(r — m)^ 



2r A 



r=rp 



Ke = 



aLE = (E + ax) 



r — m 



( 20 ) 
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given by ( 6 ) where Vp is the radius coordinate of the limiting spherical lightlike 
geodesic. For a > 0, this radius coordinate Vp is — as in [9] — extremal where the 
boundary of the exterior photon region intersects the cone d = Hence, the 
extremal values are the values of r where (7) holds with equality. Substituting Ke 
and Le in (17) and (19) by the expressions (20) provides the shadow’s boundary 
curve {0(Tp), V^(r^)) where Vp runs between the extremal values. 

If (2 = 0, then it is not possible to parametrize the boundary curve by rp because 
the right hand side of (7) is zero, so it determines a unique Vp. By (20) this results in 
an unique value for Ke which, when inserted into (17), gives the shadow’s boundary 
curve in the form ('^(L^), 6 >(L^)). Here, Le ranges between the extremal values 
determined by (4c) for = 0. 



5 Plots of Black Hole’s Shadows 



As described before, we used our analytical parameter representation (17) with (19) 
and ( 20 ) to calculate the boundary curve of the shadow as seen by an observer 
moving with four- velocity e^. The results in Fig. 3 are visualized via stereographic 
projection from the celestial sphere onto a plane, as illustrated in Fig. 1. Standard 
Cartesian coordinates in this plane are given by 

x{rp) = - 2 tan (^^) sin (^/’(rp)), 

yirp) = -2 tan cos [i>{rp)). ( 21 ) 



All plots of shadows shown in Fig. 3 belong to Kerr black holes where each subfigure 
combines the pictures for four spin values, see legend in Fig. 2. 

In principle, the shadows for moving observers (v 7 ^ 0) are calculable from the 
shadow seen by the standard observer {v = 0) with the help of Penrose’s aberration 
formula [ 12 ] 




c — V 
C V 




( 22 ) 



But for applying this formula, one may need to make coordinate transformations 
since the angles a and a have to be measured against the direction of the motion. 
Hence, no transformations are needed if the observer moves in radial direction. 



Fig. 2 Legend for the 
different spins a used for 
calculating the black hole’s 

shadows shown in Fig. 3 ^ 9 „ 4 . 

a = 0 a= a = a = a^ax 
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V — ( 0 , 0 , V — ( 0 , 0 , V — ( 0 , 0 , V ~ ( 0 , 0 , ) 



Fig. 3 Aberrational effects on the shadows of Kerr black holes. The subfigures show the stere- 
ographic projection of the shadow for observers moving with various velocities v = (vi, V 2 , V 3 ) 
which are noted beneath the plots (fq = 5m, i9q = f )• If v 7 ^ 0, the projected direction of the 
observer’s motion is marked by a (green) star. Each plot combines the silhouettes for four different 
spins (a = n - ^max where ^max = see Fig. 2 for the corresponding legend 



Then, the shadow is magnified if the observer moves away from the black hole, and 
demagnified if the observer moves towards the black hole. In this case, our formula 
(17a) reduces to the following common variant of Penrose’s aberration formula (22) 



1 + i; cos 0 



cosO = 



(23) 
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Penrose emphasized in his article [12] that the aberration formula maps circles 
on the celestial sphere onto circles. Thus, the shadow of a non-rotating black hole 
(a = 0) is always circular, independent of the observer’s motion. Consequently, our 
pictures of the shadow are then always circular, because the stereographic projection 
( 21 ) maps circles onto circles, too. 

Figure 3 shows several pictures of shadows for differently moving observers in 
Kerr space-times. The results for the standard observer (v = = e^) are shown 

in the left plot in the first row of Fig. 3. The right plot is exemplary and belongs to an 
observer moving with velocity v = j^).In each of the lower rows we 

vary only one component Vi of v; in the following we write v/ as abbreviation for 
those observer velocities v with vi ^ 0 and vj = 0,7 / / . Due to our definition of 
the tetrad in ( 8 ) and of the observer’s four- velocity ?o in (9a) the observer moves 
in 'd direction if v = and in r i.e. radial direction if i; = V 3 . For v = V 2 the motion 
is in (p direction. 

Since the last row of Fig. 3 shows the plots of shadows seen by a radially moving 
observer (v = V3), the shadows are magnified if the observer moves away from the 
black hole (^3 negative), and demagnified, if the observer moves towards the black 
hole (v 3 positive), as mentioned before. 

For velocities v = v\ or v = V 2 the shadow is shifted in the direction of the 
observer’s motion with bigger effects for higher velocities. Also the size of the 
shadow is affected. But all these aberrational changes are explainable if one relates 
the direction of the observer’s motion to the spin of the black hole and to the equatorial 
plane as symmetry plane. 

Furthermore, the shadow is symmetric with respect to a horizontal axis as long 
as the observer does not move in i3 direction because sin 'ip, see (17b), depends on 
'd which is given by a quadratic expression, see (4c). Hence, the different signs of 
^ yield different solutions of (17) for the points (0, ip) and (0,7t — pj). Without a ^ 
component in the velocity, the symmetry of the shadow is not affected even if the 
observer is not in the equatorial plane, i.e. do 

All in all, the shadows shown in Fig. 3 are calculated for relatively fast moving 
observers {v = 0.3 c up to i; = 0.9 c). Thus, the aberrational influence for the future 
observations of the shadow of Sgr A* within the Event Horizon Telescope or the 
Black Hole Cam project is expected to be very small since our solar system orbits 
the galactic center with roughly 250 ^ ~ Nevertheless, the study 

of aberrational effects are of interest from a fundamental point of view. 
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Time-Domain Inspiral Templates 
for Spinning Compact Binaries 
in Quasi- Circular Orbits 
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Abstract We present a prescription to compute the time-domain gravitational wave 
(GW) polarization states associated with spinning compact binaries inspiraling along 
quasi-circular orbits. We invoke the orbital angular momentum L rather than its 
Newtonian counterpart Ln to describe the orbits and the two spin vectors are freely 
specified in the source frame associated with the initial direction of the total angular 
momentum. We discuss the various implications of our approach. 



1 Introduction 



Gravitational waves (GWs) from coalescing compact binaries containing at least one 
spinning component are expected to be routinely detected by the second-generation 
laser interferometric detectors like advanced LIGO (aLIGO), Virgo and KAGRA [1]. 
The detection of GWs from such binaries and subsequent source characterization cru- 
cially depend on accurately modeling temporally evolving GW polarization states, 
/z+ (0 and /z X (0. from such binaries during their inspiral phase [2] . At present, /z+ (0 
and /zx(0 associated with non-spinning compact binaries inspiraling along quasi- 
circular orbits have GW phase evolution accurate to 3.5PN order and amplitude 
corrections that are 3PN accurate [3]. Recall that the 3.5PN and 3PN orders corre- 
spond to corrections that are accurate to relative orders (v/c^ and (v/c)^ beyond 
the ‘Newtonian’ estimates, where v and c are the orbital and light speeds, respec- 
tively. In the case of inspiraling compact binaries containing Kerr black hole (BHs), 
it is desirable to employ temporally evolving /z+(0 and hx(t) that incorporate the 
spin effects very accurately and the dominant spin effect arises due to the general 
relativistic spin-orbit coupling that appear at the relative 1.5PN order for maximally 
spinning Kerr BHs [4, 5]. Following Ref. [5], we define the spin of a compact object 
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as 5 = G X s/c, where mco, X ^ its mass, Kerr parameter and a unit vec- 
tor along 5, respectively and for a maximally spinning Kerr BH x = 1- We note that 
it is customary to employ the Newtonian orbital angular momentum Ln = //r x v, 
where //, r and v are the reduced mass, orbital separation and velocity, respectively, 
to specify these quasi-circular orbits [5, 6]. 

In what follows we provide a prescription to generate the time-domain amplitude 
corrected /z+(0 and hx(t) for spinning compact binaries inspiraling along quasi- 
circular orbits, described by their orbital angular momenta. Note that the amplitude 
corrected /z+(0 mdhx(t) refer to GW polarization states that are PN-accurate both 
in its amplitude and phase. We describe our approach to compute the fully 1.5PN 
accurate amplitude corrected expression for h x (0 while invoking L, the PN-accurate 
orbital angular momentum, to characterize the binary orbits. We symbolically 
obtain the additional 1.5PN order amplitude corrections to hx(t) in comparison 
with Eqs. (A3) in Ref. [6] that employ Ln to describe the binary orbits. We also dis- 
cuss certain implications of our approach while considering spin effects due to the 
leading order general relativistic spin-orbit coupling. Our attempt to perform GW 
phasing with the help of L is motivated by a number of observations (we term accu- 
rate modeling of temporally evolving GW polarization states as ‘GW phasing’). First 
being Ref. [7] that provided a prescription to implement GW phasing for spinning 
compact binaries in inspiralling eccentric orbits in an accurate and efficient way. We 
are further influenced by the fact that it is customary to use precessional equation 
appropriate for L to evolve Ln while incorporating the effects due to the dominant 
order spin-orbit coupling [5, 6, 8] . Finally, we note that a seminal paper that explored 
the inspiral dynamics of spinning compact binaries and the influences of precessional 
dynamics on x (0 employed L to describe their binary orbits [9] . Additionally, we 
specify the two spins in an inertial frame associated with the initial direction of the 
total angular momentum Jq while it is customary to invoke a non-inertial Ln - based 
orbital triad to specify the two spins in the literature. 



2 GW Phasing for Spinning Binaries Characterized by L 

We invoke, as noted earlier, the PN-accurate orbital angular momentum L to describe 
binary orbits and we specify at the initial epoch both the orbital and spin angular 
momentum vectors in an inertial frame associated with the initial direction of total 
angular momentum (see Fig. 1). We begin by presenting an expression for the cross 
polarization state having Newtonian (quadrupolar) order amplitude and the relevant 
expression reads [10] 



Iq(0 — 2 



G /i 
R' 



(1 — cos l) So sin l sin(a — 20) 



(1 -h cos i) So sin i sin(a -h 20) 
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z 




Fig. 1 The inertial frame and the Cartesian coordinate system where the z axis points along Jq, the 
direction of total angular momentum at the initial epoch. We display the angles that characterize the 
orbital and spin angular momentum vectors, denoted by A:, and S2, while the line of sight vector 
N is in the X — z plane. The dashed lines depict the projections of various vectors on the x — y 
plane 



1 9 

— -(1 + 2cos^ + cos^ i) Co sin(2ct + 20) 

1 9 

— -(1 — 2cos^ + cos^ l ) Co sin(2ct — 20) , 



( 1 ) 



where R', So and Co stand for the radial distance to the binary, sin6> and cosO, 
respectively. The dynamical angular variable O measures the orbital phase from 
the direction of ascending node in a plane perpendicular to k while its derivative is 
required to define jc^ = {Gm O/c^)^/^. To obtain temporally evolving hxlqit) 
associated with spinning compact binaries inspiraling along quasi-circular orbits, 
we pursue the following steps. First, we specify how the Eulerian angles a and 
i that specify the orientation of L vary under the conservative orbital dynamics. 
This requires us to employ three differential equations that describe the precessional 
dynamics of the orbital and spin angular momenta and these differential equations 
contain an orbital like frequency cj [5]. The conservative evolution for O and O are 
governed by the following differential equation 

6 = 0 ; — cos la , (2) 



where the orbital like frequency u is defined by the relation uj = vjr that connects 
uj to the orbital separation and velocity. This PN-accurate equation arises from the 
vectorial expression for the orbital velocity in a co-mo ving triad [5]. It is convenient 
to introduce a dimensionless parameter x = (G m cJorb/<^^)^^^ such that the above 
equation reads 6 = (G m /c^)) — cos l a. Thereafter, we impose the effects of 

gravitational radiation reaction on these differential equations by specifying how x, 
appearing in these differential equations, vary during the binary inspiral. 
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It turns out that we require to solve simultaneously the precessional equations 
for A:, Si and S 2 , the unit vectors along L, S\ and 5^2, to specify how a and i vary 
under the conservative dynamics as the precessional dynamics of L, and S 2 are 
intertwined. In Fig. 1, we display various angles that specify the orientations of these 
three unit vectors in the above inertial frame. The Cartesian components of these 
vectors are 



k = (sin L cos a , sin ^ sin a , cos t ) , 

51 = (sin6>i cos0i,sin6>i sin 0i, cos 6>i) , 

52 = (sin6>2 cos 02, sin 6^2 sin 02, cos 6^2) . 



(3a) 

(3b) 

(3c) 



The precessional 
Refs. [4, 5], read 

k 

Si 

h 



equations for the above three unit vectors, 



Gm 



Si 

Si q XI (^1 X A:) + — X 2 (^2 x k) 

q 



= (k X si) , 

Gm 



Gm 



S 2 (k X S 2 ) , 



extractable from 

(4a) 

(4b) 

(4c) 



where q — mi j m 2 and the symmetric mass ratio 7 ^ = /i/m is required to define the 
quantities Si and S 2 (<^i,2 = ?//2 + 3(l =p VI “ 4 t/)/ 4). We are now in a position 
to incorporate the effect of gravitational radiation damping and this is achieved by 
specifying the secular variations in v in the above differential equations. The PN- 
accurate differential equation for v arises from the usual energy balance arguments 
and may be obtained from Eq. (3.21) in Ref. [6]. 

We obtain numerically the time-domain GW polarization states for our binaries 
by simultaneously solving the differential equations for the Cartesian components of 
k,s\ and S 2 along with the PN-accurate equations for 6 and dx jdt. This implies that 
we numerically solve a set of 1 1 differential equations and the resulting variations in 
a, and O are implemented in Eq. (1) to obtain the temporally varying /zx Iq(0- 
A close inspection reveals that we require to specify four angles that provide the 
orientations of the two spins from Jq at the initial epoch to obtain /z x |q(0- It should 
be noted that the two angles that specify the initial orientation of k from Jq are not 
independent variables. This is because one may point the total angular momentum 
along the z-axis at the initial epoch without the loss of any generality. This allowed 
us to equate the v and y components of/ = L-h5i-h52to zero at that epoch and 
therefore to obtain the initial estimates for l and a. In our numerical integrations, the 
bounding values for v are given by vq = 2.9 x 10“^ (m' and Xf = 1/6, where 

m' is the total mass of the binary in solar units and let cc;o = 10 tt Hz as customary 
for aLIGO along with O(vo) = 0. Finally, we note that the values of a and l at 
every step of our numerical runs are obtained from the Cartesian components of k by 
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the relations: a = cos ^(kx/^k^ + k^) and l = cos ^(kz). Therefore, we require 
eight independent parameters to specify the inspiral dynamics of compact binaries 
with spinning components and these eight parameters are the four basic (constant) 
parameters namely (mi, m 2 , Xi, X 2 ) along with the above four angular dynamical 
variables that specify the two spin vectors in the inertial frame, namely (0i , 0i) and 
(O 2 , 02) as displayed in Fig. 1. 

In what follows we compare our approach with what is detailed in Ref. [6] that 
provided a way to obtain the time-domain ready-to-use GW polarization states for 
inspiraling compact binaries while incorporating all 1.5PN order spin effects both 
in amplitude and GW phase evolutions. Their approach differs from ours in two 
aspects. First, Ref. [6] employed / = Ln/|^nI to characterize the orbital plane and 
following the earlier papers employed a precessional equation for / that is identical 
to our Eq. (4a) for k while replacing A: by /. It is not very difficult to show that this 
is equivalent of using an orbital averaged expression for / [5]. However, invoking an 
orbital averaged precessional equation for / leads to an undesirable feature that the 
coefficient of / in the expression for n in the (n, A = / x w, /) frame will not, in 
general, vanish. It is fairly straightforward to compute the time derivative of n and 
express it in the (fi. A, /) frame as 



dn 

dt 



/d^' 

I h cos L 

\ dt 




sm O — sm ^ cos O 

dt 






( 5 ) 



where d, a' and O' are the usual 3 Eulerian angles required to define n, A and / in 
the 7 0 -based inertial frame. The fact that I = n x h/\n x h\ clearly demands that 
the coefficient of / in the above equation should be zero as also noted in Ref. [6] . 
However, if one employs Eq. (4a) to describe the precessional dynamics of / then it 
is possible to show with some straightforward algebra that 

, dd . . da' 3 f r ^2 

sin O — cos O sin l —— = v \0\qx\ (-^i • A) H X2 (-^2 * A) , 

dt dt G m [ q 

( 6 ) 

as noted in Ref. [7] . It is not very difficult to conclude that the right hand side of 
above expression, in general, is not zero. 

It turns out that h having components along / leads to certain anomalous terms that 
contribute to the O' evolution at the 3PN order and this is within the consideration 
of higher order spin effects available in the literature. Note that the higher order spin 
effects are known to 3.5PN order while dealing with the spin-orbit interactions [11]. 
We demonstrate our observation by noting that the definitions v = rh and v = r uj 
imply that = h • h. With the help of our Eq. (5) the expression for lo reads 
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where stands for I dt and the higher order terms that are cubic in the time 

derivatives of d and a' are neglected. Invoking the fact that i>' at the Newtonian order 
is given by x^^'^KG m /c^) and noting that the expression for dd /dt and da' /dt arise 
from the 1.5PN order differential equation for k, we get 

^' = A' +x^ b\ , (8) 

Gm \ J 



such that A' and B' are given by 

, cos d { ^ . / 

A' = ^ [^1 q XI (si • A) + - X 2 (52 • A)] cos <t>' 

sm^' I q 

+ [<5i ^ XI (si • n) H X2 (S2 • «)] sin , (9a) 

q ' 

B' = -1 q XI (si • A) + ^ X 2 (S 2 • A)]^ . (9b) 

It should be evident that the B' terms appear at the 3PN order while A' terms enter 
O expression at the 1.5PN order. It is not very difficult to infer that these B' terms 
arise due to the non-vanishing / component in the expression for h and therefore 
are unphysical in nature. Therefore, these anomalous terms contribute to the O' 
evolution at the third post-Newtonian order and this is within the consideration of 
higher order spin effects currently available in the literature. It should be noted that 
these unphysical terms play no role in investigations in Ref. [6] that probed the leading 
order spin effects appearing at the 1.5PN order in the phase evolution. 

Another consequence of invoking k to specify the binary orbit is the appearance 
of certain new 1.5PN order contributions to the amplitudes of /z+ and /zx in addition 
to what is provided by Eqs. (A1)-(A3) in Ref. [6]. We recall that Eqs. (A1)-(A3) in 
Ref. [6] provide the fully 1.5PN accurate expressions for /z+ and while invoking 
Ln to describe the binary orbits. The additional amplitude corrections to the GW 
polarization states arise mainly due to the fact that the component of v along k is 
of 1.5PN order. To demonstrate this point, we express r and v = dr jdt in the in- 
ertial frame (jc, j, z) associated with Jq with the help of the three usual Eulerian 
angles O, a and i as displayed in Eig. 1. The relevant expression for r may be writ- 
ten as r = rfi where n = {— sin a cos O — cos i cos a sin 0)Jc -h (cos a cos O — 
cos i sinof sin 0)j -h sin i sin Oz. To show that v can have non-vanishing 1.5PN 
order terms along k, we compute dr jdt in the co-mo ving frame defined by the triad 
(w, ^ = A: X n, A:) and this is easily achieved with the help of three rotations involv- 
ing the three Eulerian angles appearing in the expression for r [12]. The resulting 
expression for v reads 



( JO da \ ^ / di da\ , 

1 cos ^ I f + r I — sm O — sm ^ cos O — I k . 

dt dt / \dt dt / 



( 10 ) 
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It is not very difficult to verify that v • A: / 0 while invoking Eq. (4a) for k to evaluate 
the k component of v. Moreover, the coefficient of k in the above expression for v 
is at the 1.5PN order. The k component of v enters the expressions for and 
through the dot products (p v) (q • v) and (p • v)^ — (q • v)^ that are required to 
compute the PN-accurate expressions for /z+ and h x (the vectors p and q are two unit 
vectors that lay in a plane perpendicular to N). The resulting 1.5PN order amplitude 
corrections to /z x symbolically read 



h X 



1.5PN 



G {i Gm 
R' ^ 



di ^ da 

— ni{i, a, O, 0) H ct, 0) 

dt dt 



( 11 ) 



where the explicit expressions for and K2 are available in Ref. [10]. Therefore, 
the fully 1.5PN order amplitude corrected expression for associated with the 
spinning compact binaries in quasi-circular orbits, described by L, is provided by 
Eq. (A3) along with the above Eq. (11). This statement also requires that the angular 
variables l and a that appear in Eq. (A3) of Ref. [ 6 ] represent k rather than /. Let us 
emphasize again that Eqs. (A2) and (A3) in Ref. [ 6 ] indeed provide the fully 1.5PN 
accurate amplitude corrected /z+ and hx for spinning compact binaries in circular 
orbits described by L^. 

Another aspect where we differ from Refs. [ 6 , 8 ] is the way we specify the two 
spin and k vectors to perform GW phasing. In the literature, it is common to freely 
specify spin vectors in an orthonormal triad defined by using / [5, 6 , 8 ]. In contrast, 
we freely specify the two spin vectors at the initial epoch in the inertial source frame 
associated with Jq. This choice allowed us to specify the x and y components of k 
uniquely in terms of si, S 2 and other intrinsic binary parameters at the initial epoch 
by demanding that the x and y components of / = L -h 5i -h 5^2 should be zero at 
that epoch. Therefore, we extract very easily the initial values of a and l from the 
following expressions for the initial x and y components of k 



kx,i = sin i cos a = 
ky^i = sin z. sin (X = 



G 

c L '\ 
G 

c L{ 



{Xjxi 

ixhi 



sin 0[ 
sin 0[ 



cos (j)[ + X 2 X2 sin $2 <^os 02 } ’ 
sin (l)[ X 2 X 2 sin O 2 sin 02) ’ 



where 0 [ , , ^2 ’ ^2 values of 6 >i , 0 i , 6 ^ 2 , 02 at the initial epoch and L{ denotes 

the PN-accurate expression for |L| at initial orbital frequency. 

Eor isolated unequal mass spinning compact binaries with ^ > 3 it should be 
advantageous to specify their spins in the inertial frame associated with Jq. This is 
because 6>i (xq) that provides the dominant spin orientation from Jq at xq is expected 
to lie in a smaller range for such binaries spiraling into xq due to the emission of 
GWs. A recent study reveals that the dominant BH spin orientation from Jq at xq 
is more likely to be <90° for angular momentum dominated unequal mass binaries 
(|L|(xo) > S\) while Oi(xo) < 45° for unequal mass binaries having S\ > |L| at 
XQ [13]. These inferences require that these binaries inherit spin-orbit misalignments 
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< 160° from various astrophysical processes responsible for the formation of such 
unequal mass spinning binaries. The physical explanation for these conclusions is 
the observed alignment of S\ towards Jq due to the action of gravitational radiation 
reaction, detailed in Ref. [9] . However, it will be difficult to provide similar bounds 
for the dominant spin orientation if the spins are freely specified in a non-inertial 
orbital triad associated with L at xq. In this case k • s\ provides the dominant spin 
orientation at xq and it is not difficult to show that k • s\ remains fairly constant as 
these unequal mass binaries with q > 3 spiral into xq from initial orbital separations 
<1000 Gm/c^ due to the emission of GWs. 
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